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THE PHILOSOPHY — 
OF QUANTUM MECHANICS. 


PREFACE 


The present monograph is devoted to the principal problems of quantum 
mechanics and is based on the conception first stated in my course on 
‘Fundamentals of Quantum Mechanics’. The scope and purpose of the 
above course did not allow some principal questions to be brought out 
as fully as they deserved, and besides, some important points were only 
very recently developed to a sufficient extent. This refers especially to the 
analysis of the action of the measuring instrument, whose dual role as 
an analyser of a quantum ensemble and as a detector of individual events 
was insufficiently elucidated. 

The reader will find that the present monograph is concerned more 
with theoretical physics than with philosophy. However, I have never 
separated Weltanschauung from science (and particularly theoretical 
physics) so that the philosophical implications are also discussed, justi- 
fying publication in the philosophical series. 

In conclusion, I should like to thank the publisher and the translator, 
whose initiative and effort have made it possible for the book to reach 
the English-speaking reader. 
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CHAPTER I 


THE ILLUSION OF DETERMINISM 


“Necessity of this kind does not take us outside the 
theological view of nature. For science it is largely a 
matter of indifference whether we follow St. Augustine 
or Calvin and term it God’s will, or the Turks and ascribe 
it to kismet or fate. In neither case are we tracing the 
causa] chain, and hence in neither case are we any the 
wiser. Necessity remains an empty phrase, while the 
situation remains as it was.” 


ENGELS, Dialectics of Nature 


It might be said more fully that the illusion of determinism began with 
the proud though empty boast of Laplace’s ““Give me the initial data on 
all particles and I will predict the future of the Universe’. 

When we recall these resounding words the time of that great 18th- 
century scientist appears as far off as though it were viewed through the 
wrong end of binoculars; today we are more humble and very far from 
the hopes of the mechanistic age. Even the early years of this century, which 
have destroyed the naive illusions of the 19th century when it seemed 
that all was clear and that science had run out of problems, seem very 
remote. 

Today we are facing a new frontier, probably a highly revolutionary 
one, whose outlines appear immense though still largely shrouded in mist. 

It may be that this is merely our impression, derived from a desire to 
forego former principles now clearly outdated, but which we retain solely 
because we have no new ones and even because many fear the change 
with its consequent upheaval of our entire way of thought. 

The most striking feature of this trend is that it encompasses not only 
physics but also biology and cosmology; it is capable of having a great 
influence on our philosophy. Under these conditions it is entirely in order 
to take a critical view of the bases of the theory. 

These bases include the concept of classical determinism. It would be 
more accurate to say that the doubt is not so much over determinism 
itself as over unquestioning worship of ideal determinism. 
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For a long time humanity believed in divine predestination, and after- 
wards in rigid causal connection. Engels appreciated the philosophical 
resemblance and narrowness of these viewpoints, while failure to appreciate 
this affinity has over the Centuries been the cause of tragedy to many 
outstanding men. 

The desire to know the truth is as old as the human race, and this 
desire must thrust aside any feeling of caution over the possible loss of 
the security provided by customary concepts. 

We must now accept that we cannot ignore the element of games of 
chance in the behaviour of the Universe; if we were to believe in a God 
or other guiding hand, we should still have to accept that this God (or 
the equivalent) has a penchant for games of chance. His Majesty Chance 
enjoys the explicit indulgence of the Law and makes our events un- 
expected or improbable; there is even a range of phenomena where he 
feels himself particularly at liberty, namely atomic and molecular physics. 

Boltzmann, a great physicist and materialist, first remarked on this 
important feature and put forward the ingenious hypothesis that the 
entire region of the universe accessible to us is the result of a vast fluctu- 
ation such as has never been seen before or since. 

We must accept that the world is not built in the tidy or simple fashion 
envisaged by believers in the various metaphysical systems. 

To appreciate this concept it is best to begin with the defects in the 
ideal we hanker for, and this is the starting point of the present book. 

The principle of determinism is particularly clear in classical mechanics, 
which seeks to define completely the future state of a system on the basis 
of the initial state. 


CHAPTER II 


CLASSICAL MECHANICS AND CAUSALITY 


Classical mechanics is the simplest example of a theory dominated 
throughout by determinism. Even if we do not learn it at school, we are 
taught in our first year at university that the laws of classical mechanics 
allow one to predict the future of a mechanical system whose initial data 
are available. This appears a trivial truth but in fact contains much that 
is false or at least debatable. For example, the initial data cannot be 
specified with unlimited accuracy, so we really have a certain distribution 
for the initial data, the true values being known only with a certain 
probability. 

This means that we must consider how far this indeterminacy affects 
our prediction of the state of the system as foo. 

Moreover, the system is influenced by unpredictable accidental forces 
during its motion. Even if these forces are small, their effect may become 
considerable after prolonged periods. The effects of such random forces 
cannot be eliminated from the real motion. 

Finally, we must ensure that the system will remain isolated during the 
time for which we are to predict its future. This means that we must 
specify the boundary conditions at the boundaries of the region within 
which the motion occurs. All these three aspects are usually ignored in 
the exposition of classical mechanics. 

Before considering the effects of these circumstances on the predictions 
derived from the laws of classical mechanics, a few words must be said 
on two methods of describing the motion of systems in classical mechanics. 
Consider a mechanical system with f degrees of freedom. Its state is 
characterized by the values g,, q2,..., 7, (in short, g) of the coordinates 
and by the values of the momenta py, p>,..., pr (in short, p) conjugate 
with these. The manifold of points g forms the configuration space R(q), 
while that of the points p forms the momentum space R(p). The name 
phase space is given to R(q, p), the two spaces taken jointly, R(q) x R(p). 
The state of the system in classical mechanics is characterized by a point 
in phase space. 
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The first mode of description is that in which we consider a system 
which at time t=0 lies in the zone g=qo, P=Ppo of phase space, for which 
we compute the trajectory g=Q(t, qo, Po), P=PA(t, Yo. Po) im space 
R(q, p). This trajectory is computed from Hamilton’s equations!+?, which 
describe the motion in phase space: 


4, = LH, qs] > P= LH, Ps] (1) 


with s=1,2,...,f, Here H=H(q, p,t) is Hamilton’s function, which 
equals the sum 7+ U of the kinetic energy T(p, q) and the force function 
U(p, q, t) characterizing the interactions of the particles mutually and 
with the external world. In the special case of 0U/dt=0 the force function 
is the potential energy. Also, T(p, g) is quadratic in p. Further [A, B] 
is the classical Poisson bracket, defined as 

f 


Cee Ne 0B GB =} 2) 


Op,0q, Op, 0g, 


s=l1 
so that 
OH oH 
LH, qsl=>- >> LH, p.]=-—.- (3) 
dp 0g 


KY s 


System (1) describes completely the motion of an isolated mechanical 
system. 

The other mode of description is more convenient when the initial data 
for the system are undefined. 

Consider a large number of identical systems in different states. The 
points representing these are distributed in phase space, and under certain 
conditions (which we shall not consider in greater detail) we may speak 
of the density p(q,p,t) of these points, so that p(q, p, t) dp dq/(2nh)S 
(dp dq=dp, dp2...dp, dq, dqz...dg,, is an element of volume in phase 
space, while the factor (27h) is taken as unit phase volume) is the number 
of systems whose coordinates and momenta lie around the point (p, q) 
at instant ¢. Since the number of systems is conserved, p(q, p, ¢) must 
satisfy the equation of continuity in phase space, i.e., 


f 
op op , ap 
— + mons aed =(Q, 
at he aera : : a 
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Then (1) and (3) allow us to put this in the form 


dp 

--+([H, p] =0. 5 
a, + LH: #1 (S) 
On the other hand, the complete derivative of p with respect to time is 
p=0p/0t+[A, p]. Therefore Louiville’s theorem follows from (5): 


p=0, (6) 
which shows that p is constant along a trajectory. 

In this mode of description we are given at t=0 not the initial data 
but the function po=p(q, p, 0), which essentially describes the distri- 
bution of the initial values of p and q; what we seek is the distribution 
of p and q for t>0. This is equivalent to considering the motion of a 


large set of independent mechanical systems that differ only in their 
initial data. 


A. EFFECTS OF INITIAL DATA 


We may avoid obscuring essentials with complicated expressions by con- 
sidering only the case f=1. First we consider the case of free motion, 
for which T=p*/2m and U=O, m being the mass of a particle. In this 


case (5) gives 
ap P 


0, 7 
ot doqm 7) 


whose general solution is 
Pp 
p (4 P, )=po(a-2 f, r). (8) 


in which p,(q, p) is the initial distribution of q and p. A particular case 
is an initial gaussian distribution: 


2 2 
po (4: p= ageo(- Pap Gl), (9) 
This has the variances (mean-square deviations): 

Ap’ =(p—p) =4a” and Aq” =(q — Go) = 40”. 
Then, for t>0, (8) gives 


_ (p= Po) _ (a —(plm) t aor (10) 


Gece 
p q; P, a exp a2 b? 


b 
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It therefore follows that the distribution in p for t>0 is 


p(p, t) = J p, t) dq = i exp( - eo mr) (11) 


a 
while that in q is 

1 
a/b? + (a?1?/m?) 


exp( (4 = (Poi a (12) 


== (6? + (a?/m?) 17) 


ACE )=[@ P; t) a) 


Thus the p distribution is unchanged, while the q one is altered, 4q? 
increasing with time: 
2 
Aq? = (2 + ae r) (13) 
2 m 
Thus the initial information about the position of a particle is completely 
lost at a time t>mb/a; particles differing in spatial position at t=O later 
become mingled. The information about the position of the particles in 
space continues to deteriorate as with the passage of time. 

According to M. Born, this conclusion is applicable to a much broader 
class of systems, namely quasiperiodic ones, in which we may introduce 
the so-called cyclic variables g, and the corresponding actions J, (See for 
example refs. 1 and 2). The Hamiltonian is constant in these variables and 
is dependent on J alone; Hamilton’s equations in these terms state (for 
one degree of freedom) that 


_ oO : 0H 

Orr ae ai ah (14) 
so that 

Q9=Ol+ 9), w=const. (15) 


Here q is expressed in terms of @ and J as a Fourier series: 


q(t)= » A, en, (16) 


n=-—a 


in which the amplitudes A are functions of J. Equations (15) coincide 
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formally with the equations for the free motion of a particle: 


OH 
pao pg Go Se econet 
m mop 


If w and @ are initially gaussian, the conclusions drawn for free motion 
apply in their entirety to quasiperiodic motion. Here the role of the 
momentum (or rather, the velocity p/m) is taken by the angular velocity 
w@, while the angle @ replaces q. It is clear that the information about the 
position of the system will be completely lost if the uncertainty in » 
exceeds 27. Eq. (13), in which a represents the uncertainty in @ and b 
represents the initial uncertainty in @, shows that this occurs for t>2z/a. 

We may note that exactly the same situation occurs in quantum me- 
chanics, but with the essential difference that a very narrow initial distri- 
bution in qg inevitably leads to a particularly rapid broadening of the 
distribution. Thus, we see that, for aperiodic motion, there is always a 
time ¢ such that for t>bm/a the uncertainty in the coordinate becomes 
greater than any uncertainty in the initial information as specified via b, 
and hence destroys the accuracy of that information. 

The information on the position is completely lost for t>2z/a in the 
case of periodic motion. 


B. RANDOM FORCES 


In the previous section we discussed the effects of uncertainty in the 
initial data on the future predictability of the state of a mechanical 
system. However, during its motion a particle may additionally be subject 
to forces that cannot be predicted exactly. Such forces are inevitable under 
real conditions, although they may be extremely small. For example, they 
may arise from collisions between the body and the molecules of the 
medium, from turbulence or from any other random inhomogeneity in 
the medium. 

It is impossible to give a theory of these effects for the general case, 
although they have been the subject of searching investigation by mathe- 
maticians. However, it is not too difficult to allow for the effects of these 
forces if they are small relative to the principal forces or if the main 
motion is described by linear equations. This is the type of case we shall 
consider below. 
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We put the equation of motion in the Newtonian form 
mg = F(q, t). (17) 


and assume that the force F(qg, t) may be resolved into a regular part 
F(q) and a random part f(t), the latter being dependent on time alone. 
We also resolve the coordinate g as Q+Aq, in which Aq is the deviation 
caused by the random force f. Neglect the higher powers of Aq in (17) 
and we obtain a linear equation for Aq: 
Ag-— Aq =- f (0). (18) 
m 0g m 


This equation may be solved via a Green’s function G(t—t’), which fits 
the equation 


6-H) - G(r) = 5-1) (19) 


and is subject to the boundary condition 
G(t—?t')=0 for t-?t’ <0. 


We use Green’s function to seek a solution of (18) in the form 


t 


Aq = ~ (66 —?t')f(t') dr’. (20) 


The following notation is used for the mean of the product of the 
random forces acting at different instants t and t’: 


a-Att—ty=f(Os(), (21) 


in which a? is a dimensional coefficient. This quantity is essentially the 
value of the autocorrelation function of f(t) for the two instants t and t’. 


From (20) and (21) we have Ag? as 
t t 
2 
pee a , u I " f " 
Aq =", {a {a “G(t—1t’)-G(t—t")-A(t’ —#’). (22) 
0 0 


We assume that the autocorrelation function A(t) decreases rapidly, be- 
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coming essentially zero for |t|>t. If the Green’s function varies smoothly 
in the interval up to t, we may integrate with respect to t” and normalize 
A(t) via 


+0 
| A(t'—t")dt’ =1, 
-2 
the result then being 
a’ 
Aq’ (t) = [ar G(t—t'). (23) 
m 
0 


We apply this formula to the linear equation 


1 
Gtigt+o'q= me AUE (24) 


in which A is a frictional coefficient, w is the natural frequency, and f(t) 
is the random force. The Green’s function for this equation is# 


G(t)= ee (exe (: (e-*) 
= exp(- Ea) for t>0 (25) 


G(t)=0 for t<0. (25’) 


and 


Here we may consider two cases. 
a) Free motion with friction: o=0, 440. From (23) and (25), for t-> 00 
2 2 
—; la aia: , a 
Aq’ (t) => ae ale a=€ Att le dt = 2m? tte. (26) 
19) 


b) Harmonic oscillator with friction: from (23) and (25), for too 


a 


24m7a" 


Aq’ (t)= fore, (27) 
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The corresponding results for zero friction are 


aes RIG a 
Aq (t) = 3m? - Se (26’) 
a 
40 Fata @7" 


These examples show that random forces may have pronounced effects 
on the predictability of the motion in classical mechanics. 


C. BOUNDARY CONDITIONS 


It is not usual to speak of boundary conditions in mechanics; rather, the 
conditions of isolation of the system are considered. By this is meant 
that the system is subject to no forces other than those appearing in the 
equation of motion, or that these do not act on the system during the 
time interval of interest to us. 

In effect, it is assumed that a surface forms the boundary of the region 
of space in which the motion occurs, this surface being impenetrable to 
external fields or bodies. Figure 1 shows such a surface for the case 


a 


a oO hb 


Fig. 1. Initial data are given for the surface o’ (part a’b’), while the boundary data 
are given for surfaces S’ and S” (parts a’a” and b’b’). The line 00’ is the trajectory of 
the particle. 
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of one-dimensional motion along the OX axis. The system is isolated if 
we Can ensure that no external field or body enters through the surfaces 
S’ and S$”. For example, a planetary system may be treated as closed 
only because we can be sure that no unforeseen heavenly body will enter 
it or pass near it within the period of interest to us. 

This feature is extremely important, because it shows that a deter- 
ministic prediction of the future is itself conditional: the future of a 
mechanical system may be predicted only if we can be sure that the 
system is isolated. The guarantee required here is not implied by the 
equations of motion but is an additional condition, which produces a 
great reduction in the reliance on determinism. A vast and depressing 
“if”? arises in the way of the prophet who sets out to predict the future 
of a real mechanical system. 


D. SOME REMARKS ON FIELDS 


Consider now the same problem as regards a field that obeys a linear 
equation; for simplicity, let us limit ourselves to a scalar field p= ¢(x,t), 
which obeys 
ao 2 2 
“5 - Woe-—Ke=0. (28) 
ot 
The general solution to this may be expressed in terms of the initial 
data (x, 0) and 0e(x, 0)/dt, together with the boundary values of (x, t) 
and d@(x, ¢)/0n, in which n is the normal to the bounding surface, these 
being given for all future times t>0 by Kirchoff’s formula®: 


de(x’, ') dg(x', t,x, 


= stg IN OE Ne Bee aoe ee e t’ 
p(x, t) IE (x, t; x, t’) on’ Cn’ p(x | 
do’ = [ow t, x’, t')-p(x’, ') da’, (29) 


in which g(x, t,x’, t’) is the Green’s function of (28) that satisfies the 
inhomogeneous equation 
0*g 


<= V8g — xg =— 5(- 11) 6x - x), (30) 
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Fig. 2. Initial data provided only for o’ (a’b’), with boundary data unknown; the 
region o”, within which the field g is known, contracts with the velocity of light c, and 
ao” =Oatt=—a’b'/c. 


and G is the operator g(@/dn')—@g/dn’. The surface a’ over which the 
integration is carried out has n’ as its normal and consists in part of the 
time sections S’ and S” (Figure 1) and in part also of the spatial section a’ 
for which the initial data are given. Formula (29) predicts the values 
within the volume bounded by the surfaces a’, a”, S’, and S”. 

Formula (29) shows directly that the field is expressed not only in terms 
of the initial data (on surface a’) but also via boundary values (on surfaces 
S’ and S”), which we are obliged to specify for all future time. Then the 
“if”? that arises in mechanics acquires in field theory a direct mathematical 
expression in terms of the integrals over the time surfaces S’ and S”. The 
physical significance of these integrals is obvious: they describe the effects 
of fields arising from regions of space not included in the initial data. 
If we refuse to consider the initial boundary data, our reward is to be 
restricted (as regards prediction of the field) to a region of space steadily 
contracting with the passage of time, as shown in Figure 2. Unforeseen 
signals penetrate into our region of space with the velocity of light, and 
that region contracts at the same rate; in a time t= R/c the entire region 
is filled with waves not included in our calculation, and by this time our 
prediction will have lost all validity. 

The linear character of the field allows us to write down directly the 
cross-correlation of the fields at points ~(x; t’) and (x, t) if the initial 
data and boundary values fluctuate. We have from (29) that 


¢(x’, t) o(x, ft) zy few. P") G(F’, P") D(P" — P") do” do”, (31) 
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in which 
D(P — P") = b0(F") 69 (F"), (32) 
is the correlation function for the fluctuations dg(A") and dg(7'") at 
points P”(x”, t”) and #'"(x'", t’") lying on the boundary of the region 
of integration (on surfaces S’, S”, 0’). This expression will not be con- 
sidered for detailed cases, as it is closely similar to result (22) for me- 

chanics. 
Consider the particular case in which the fluctuations are confined to 
the initial data. There is then a more direct way of calculating the field 
variance. We expand (x, ¢) as a series in terms of the proper modes 


x): 
ne (x, 1) =F.4(0) Vals). (33) 
k 
The amplitudes of the partial modes q,(t) are determined by the oscillator 
equation 7 
: Gu + O14, = 9, (34) 
in which @, is the frequency of mode k: 
dy = 4, exp (iw,t). 
The cross-correlation function for points P(x, t) and A(x’, t’) is 
D(P — F’) = (x, t) (x, r') 
= PL WE) Wax) ag, ae OM (35) 
k k’ 


If the individual modes are statistically independent, 


ay, a, = a(k) d(k —k’) (36) 
and the expression becomes 
D(P — Ff’) =Y WE (%’) y (x) w(K) clr), (37) 
k 


and if d(k) is constant (if the mean of aja, is the same for all k), we have 
instead of (37) that 


DP —F') =a UEC) Vals) eH? 


This expression coincides with the Green’s function for (28), and for 
t=t' we obtain that 
D(P — F’)=a-5(x' — x). (38) 
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The variance (dispersion) given by (37) does not increase with time. This 
is a feature distinctive of linear systems. 

There are some major difficulties in the discussion of nonlinear systems, 
but we can obtain some idea of the behaviour of a nonlinear field if we 
bear in mind that the latter may be considered as an infinitely large set 
of coupled oscillators. As a system containing a large but finite number 
of coupled oscillators is quasiperiodic, the variance will increase with 
time if the initial data are not completely defined. If this conclusion may 
be transferred to an infinite set, we would expect the variance to increase 
for a nonlinear field. 


E. CONCLUSIONS 


We have seen that the basic assumption of classical mechanics (that it is 
possible to determine uniquely the future state of a system from its initial 
data) is based on an abstraction that excludes all randomness. The effects 
of randomness cannot be overlooked in the general case, since the un- 
certainty arising from the initial data increases with time, and the pre- 
diction becomes quite meaningless after the lapse of a certain finite time. 

In fact, it is common knowledge that the input data must from time 
to time be corrected even in a science as precise as celestial mechanics, 
in order to eliminate cumulative errors. 

No machine can be left to operate for an indefinite time without 
touching the controls to remove errors accumulating during the operation. 

The supporters of inflexible determinism are wrong in taking classical 
mechanics as their example: the motion predicted by classical mechanics 
is unstable either on account of small random deviations in the initial 
data or owing to the presence of random forces. No matter how small 
these perturbing effects may be, there always comes a time at which their 
consequences will predominate. This instability against slight randomness 
completely destroys the illusion of unambiguous prediction of the future 
from the initial data without subsequent correction during the process. 
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CHAPTER III 


A GIBBS ENSEMBLE 


Experiment shows that a system subject to random factors may show 
well-defined laws when the observations are repeated many times. Such 
laws are usually termed statistical. Random factors are always present in 
any real case, but their magnitude depends very much on the nature of 
the system. A system with numerous interacting degrees of freedom may 
be dominated in behaviour by the random factors; then a dynamic law 
becomes the slave of chance (a deterministic law is often termed “‘dy- 
namic”’ in mechanics), but chance itself produces a new type of law, the 
statistical regularity. 

An example of this occurs in the Maxwell-Boltzmann kinetic theory of 
matter. Major gifts from His Majesty Chance are the Maxwell distri- 
bution and temperature, which relate molecular processes to macroscopic 
ones. In his H-theorem Boltzmann attempted to show that Chance, 
strictly speaking, cannot do other than give rise to the Maxwell distri- 
bution. 

There are no methods for “‘deducing”’ statistical laws from deter- 
ministic ones; at best they can be made compatible. Any system where 
chance plays a major part always requires the use of special assumptions 
of probability — theory type in order to derive its laws, e.g. the assumption 
of elementary disorder or some other assumption of equal probability for 
states of a dynamic system. 

We shall not enter deeply into these topics, by which learned men have 
long been confused, and shall assume de facto that chance can give rise 
to a law no less than can determinism. 

Gibbs, the founder of molecular statistics, was the first to perceive that 
we are not obliged to seek the ways in which chance brings a mechanical 
system to a state that is definite in the statistical sense; we may make 
some general assumptions and later compare them with experiment. 

Gibbs introduced the idea of an ensemble of systems, which has proved 
of extreme importance throughout statistical physics.1 Consider a large 
number of identical and completely mutually isolated systems, each of 
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which interacts weakly with a large macroscopic system at a set temper- 
ature (thermostat). This set of systems is known as an ensemble; Figure 3 
shows such an ensemble in the form of a row of thermostats -# receding 
to infinity, together with the systems yp interacting with them. In fact, it 
is not essential to have numerous thermostats -/#; one will suffice, pro- 
vided that the systems that interact with it thereby exert no mutual influ- 
ence nor alter the macroscopic state of the thermostat. 

We now measure some of the dynamic variables (e.g. the momenta p 
and coordinates q) on a large number of the systems constituting this 


Fig. 3. A Gibbs ensemble in the form of an infinite sequence of identical micro- 
systems y interacting weakly with thermostats all at the same temperature 0°. 
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ensemble. Gibbs asserted that there is a completely definite probability 
for any given result from this experiment, i.e. that we have the canonical 
distribution 
Wo(p, 9) dp dg = exp ; 

in which W,(p, q)dp dq is the probability of finding that the momentum 
lies in the range p to p+dp and that the coordinate lies in the range 
q to g+dgq (all formulae are explicitly for one degree of freedom, for 
simplicity); 0 is the temperature of the thermostat, e(p, g) is the energy 
of the system, and exp[w(0)/0] represents a normalization factor. 

The existence of this distribution is a hypothesis; on this point Gibbs! 
stated that the distribution represented by (1) is the simplest conceivable 
case, because it has the feature that the law of distribution between phases 
for the parts is the same for all when the system consists of parts with 
distinct energies, this feature very greatly simplifying the study and 
forming the basis for extremely important relationships in thermo- 
dynamics. This view may be supported by the following argument?. 
Louiville’s theorem indicates that the density p(p, g) of points in phase 
space is constant along a trajectory, which means that it must be a function 
of the integrals of motion alone. We now assume that our system can be 
divided into two weakly interacting parts A and B; then p is the product 
of the densities for subsystems A and B: 


(= #102) ay ag, is 


Pa+e=Pa'Pa- (2) 


This means that p can be a function of the additive integrals of motion 
alone. If we disregard macroscopic movements of the system as a whole 
(rotation, translational motion), the only additive integral of motion is 
the energy of the system: 
Solution of the functional relation of (2) now gives 

p= ott BE (3) 


Bearing in mind that only few systems will have infinitely large energy, 
we conclude that <0; let B= —1/0. Also, « is defined by the normali- 


zation condition dp dq 


[ ov. q)° sp (4) 
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Hence «=(6)/@. Next we assume that one of the systems is very large, 
so that it may be considered as a Gibbs thermostat; comparison with (1) 
then shows that 0 is the temperature characterizing the ensemble in 
equilibrium. 

An attempt is sometimes made to derive the Gibbs distribution by 
considering microcanonical ensembles, which involves fresh hypotheses. 
To avoid the tangled question of axiomatics, we shall consider only the 
original approach derived from Gibbs, because this is sufficient for our 
purpose. An exceptionally interesting feature of W,(p, q) is that it relates 
the parameters of the macroscopic environment -@ (i.e. the temperature 
6 of the thermostat) to those of the microsystems (momenta p and 
coordinates g). We may say that W,(p, q) relates to a microsystem in a 
specified macroscopic environment. A Gibbs ensemble is merely the 
result of bringing together a large number of such situations; in many 
cases it is practically feasible to a high degree of approximation. 

For the sake of future requirements it should be pointed out at this 
point that no great significance is attached to the question whether 
W,(p, q) relates to a single particle or is a characteristic of many particles 
(a question posed largely by the disputes of believers). The essential point 
is that, no matter whether the observation is made, a single measurement 
alters only our subjective relation to the observed fact: if a typical 
phenomenon occurs it is reasonable to say ‘“‘as expected’’, while if a rare 
event occurs we may merely express our surprise or pleasure, as at a good 
win in a lottery. All of this concerns subjective evaluations; the only 
thing of objective significance is the distribution of the results of measure- 
ments, and this is derived from the performance of numerous measure- 
ments on the ensemble, this being the distribution predicted by the proba- 
bility Wa(p, q). 

The existence of this probability is one of the striking and fundamental 
laws of statistical molecular physics. 

Here once more we are indebted to the game of chance, which conflicts 
with its one intrinsic nature in giving rise to laws as simple as the canonical 
Gibbs distribution. 
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CHAPTER IV 


A QUANTUM ENSEMBLE 


Consider the case of a Gibbs ensemble whose thermostat is at zero 
temperature. The canonical distribution implies that the energy of the 
system must be as low as possible, which means that the kinetic energy 
T(p) must be zero, while the potential energy U(q) must be at the mini- 
mum. This further implies that the momenta must be zero, and the 
coordinates are obliged to have the one single value corresponding to the 
minimum potential energy. In other words, 8=0 implies that each 
dynamic variable is confined to a single value, with no statistical spread 
at all. In particular, there should be no light scattering due to molecular 
thermal motion. However, experiment shows that such scattering does 
occur at low temperatures and in the limit (at absolute zero). This shows 
that motion persists in some form at this temperature, where all motion 
should have ceased. James, Brindley, and Wood were among the first to 
examine this, and in their paper! on the scattering of X-rays in an 
aluminium crystal they stated that it is necessary to assume a zero-point 
energy (i.e. an energy of motion at absolute zero) in order to obtain 
agreement with experiment. 

This unfreezable motion is a motion of a new (quantum) type. A Gibbs 
ensemble spontaneously becomes a quantum ensemble as absolute zero 
is approched. The problem here is then whether any new laws arise in 
consequence, and if so, what is their form. It turns out that there exists 
the full symphonic of the new statistical laws governing the motion of 
microparticles when macroscopic bodies, which determine the conditions 
of motion for microparticles, are at absolute zero. These new laws consti- 
tute quantum mechanics, which deals with the motion of microparticles 
in a quantum ensemble. 

A very important characteristic feature of a quantum ensemble is that 


there is a fundamental relation between the variance Ax” =(x—)°* of the 
coordinate x (in which the bar denotes an average over the ensemble, 
X being the mean of x for the particle or microsystem) and the variance 


Ap? = (p—p)? of the conjugate momentum p. This is knownas Heisenberg’s 
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uncertainty principle 
2 


eee Ih 
Ap’-Ax">7, (1) 


in which A is Planck’s constant.2-3 

So far the arguments have been general. Let us now turn to a more 
rigorous definition of a quantum ensemble. Consider a set of macroscopic 
bodies, which can be termed the macrosetting -@. This macro environ- 
ment in some way determines the state of motion of the microsystem yp 
(sometimes simply the word “‘state’’ will be used, the word ‘“‘motion”’ 
being omitted). We now imagine -%, with its 1, reproduced an indefinitely 
large number of times, as for the thermostates and microsystems in a 
Gibbs ensemble. 

If in this totality of systems 


Ap de = 


2 
= Q) 
4 
we call the set a quantum ensemble.?: 3.4 

Figure 4 illustrates a quantum ensemble as an indefinitely receding row 
of macroscopic bodies -@ and microsystems yp. 

The macroscopic setting is constituted not only by macroscopic bodies 
but also by macroscopic fields, and it includes the conditions of generation 
of the particles (particle sources). 

The more exact definition of a quantum ensemble contains no as- 
sumption about absolute-zero temperature, by which previously we 
wished to stress that the thermal motion of atoms or molecules has no 
effect on the statistics of quantum effects. In fact, quantum effects are 
also observed in this case when the temperature of the bodies is above 
zero. Here we speak of quantum statistics, which acts as a correction to 
the Gibbs distribution at low temperatures. More precisely, we may say 
that we are concerned with quantum mechanics if (2) is obeyed and if 
the kinetic energy of the particles T(p)=p*/2m > 0, with m being the mass 
ofa particle, whereas we are concerned with quantum statistics if T(p) = 0. 
We now must examine whether we can describe a quantum ensemble via 
a probability of the type constituting the basic law of a Gibbs ensemble. 
A priori there would appear to be no reason why this should not be 
possible. 

In fact, this is not the case. Relation (1) states that this possibility is 
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Fig. 4. A quantum ensemble: an infinite sequence of identical microsystems y each 
in the same macroscopic setting -4 (e.g. an accelerator operating in a given mode, a 
magnetic analyser, collimating slit, etc.). 


ruled out, for it indicates that there is no quantum ensemble for which 
Aq? and Ap? are simultaneously zero; if there were a probability W(p, q) 
that would predict the possibility of finding a microsystem with q as its 
coordinate and p as its momentum, then it would be possible to state a 
method of selecting such systems and hence to construct from these a 
new ensemble having Ap? =0 and Aq? =0. Such an ensemble would 
conflict with the uncertainty principle. 
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We must thus conclude that there is no such probability W(p, q), and 
we must examine what replaces it for quantum ensembles, how the latter 
may be characterized, and how the probability of different results of 
observation may be calculated. 

The problem may be dealt with by formulating Heisenberg’s relation 
in somewhat more general form, via Bohr’s principle of complementarity, 
which can be put as follows: 

The dynamic variables of a microsystem may be divided into two 
mutually complementary groups: space-time ones and momentum-energy 
ones. There is no ensemble in which both groups of dynamic variables 
could have definite values. 

This is a direct generalization of Heisenberg’s relation. Bohr himself 
formulated the principle in a somewhat different way, reflecting his philo- 
sophical concepts which were far from those of materialism. His formu- 
lation has been the origin of the far-reaching conclusion that the current 
mechanics of the atom cannot be compatible with materialism. We shall 
not deal in any further detail with this aspect of the question, but there 
is an extensive literature®»®’? presenting various points of view. 

It would seem generally better to speak of a principle of exclusiveness 
rather than complementarity: dynamic variables should be divided into 
mutually exclusive groups, which do not coexist in real ensembles. How- 
ever, out of respect for Bohr and his tradition we shall retain the usual 
terminology. 

Complementarity rules out the description of microsystems via a phase 
space R(p, q), since this contains the complementary variables p and q; 
however, the principle does not forbid us to use either configuration space 
R(q) or momentum space R(p), as each of these contains variables that 
are either only spatial or only momental. A system with f degrees of 
freedom is described by / variables either in R(q) or in R(p), in contrast 
to classical mechanics. The variables p and q form a complete set of 
variables. 

More precisely, we define the complete set of dynamic variables suf- 
ficient for an exhaustive description of a quantum system with f degrees 
of freedom as a set of fsimultaneously measurable independent variables: 
91> Ja>+++5 Vy (Spatial, set of g) or py, Po, ..., Pp (Momentum, set of p). Such 
a set of dynamic variables contains the maximum information on the 
system compatible with the laws that govern the microworld. 
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A quantum ensemble therefore has the probability 


War (9) dq (3) 


that the specific value q will be found in a measurement. The subscript 
M has the same meaning as the subscript 6 (temperature) in Equation 
(III-1), for a classical Gibbs ensemble; it indicates the macroscopic setting 
(M) that governs the conditions of motion for the microsystem (,:) and 
hence determines its state. 

A further question is the probability that a microsystem in the same 
ensemble will have some particular value in the complementary (p) set. 
This probability may be put as 


Wy, (p) dp. (4) 


The number of probabilities of the type of (3) or (4) is not necessarily 
small; in principle, the number may be as large as the number of possible 
distinct complete sets (q) or conjugate sets (p). 

The number for any particular real system is restricted by the fact that 
many of the sets are not convenient for description of the system, but 
this is a purely practical consideration, so the number of probabilities 
such as (3) or (4) may be indefinitely large. The set of these characterizes 
completely the state of a microsystem in a quantum ensemble, since this 
set essentially exhausts the predictions of all possible measurements on a 
microsystem in this ensemble (M+ p). 

This raises the problem of counting the probabilities, including those 
unsuitable for practical purposes. If a way of calculating the number can 
be found, we will have a method of describing the state of a microsystem 
in a quantum ensemble. 

The answer given by quantum mechanics to this question is quite un- 
expected (from the viewpoint of the classical theory): there is a quantity 
characteristic of the ensemble 4+ y that characterizes the quantum en- 
semble completely in the sense that, when the quantity is known, one can 
calculate all possible probabilities of the type W,(q) or W,(p). This 
quantity is the wave function: 


Va= wu (4) . (5) 


This yw, is written explicitly as a function of the coordinates q, the relation 
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of the wave function to the probability density in this case being given by 


Wu(4) = Wu(Ql’- (6) 


The above formula is completely general in the following sense: the 
probability density for the value p in the set of variables (p) is as follows: 


Walp) = Walp). (6’) 


Here the wave function W, is given as a function of the variables of some 
other complete set, for example Wy ,(p). More generally, for a complete 
set of variables (a) we have 


W.u(a) = |Win (a)|? , (7) 
and for a set (b) 
W.u(b) = Wa (b)|? (7’) 


etc. All these functions W.y(q), W.a(p), W.n(@), W.4(d), and so on describe 
the same quantum ensemble, which is characterized by a macrosetting 
and a microsystem uw. 

This feature is stressed by saying that the wave function is given in the 
q-representation or in the p-representation, in the a-representation, and 
so on. 

The mathematical equivalence of the wave functions in the various re- 
presentations, as regards description of the quantum ensemble, is seen 
from the fact that wy, may be considered as a vector in Hilbert space, 
while the representations may be considered as representations of the 
vector in different systems of loci (orthogonal unit vectors), a linear 
combination of these allowing one to represent any other vector.? 

In this formulation, rotation in Hilbert space represents a transition 
from one representation of the wave function to another. This rotation 
is performed via a unitary matrix S, so that 


Win(b) = SWy(a), (8) 
with 
ae (9) 


in which S* is the matrix conjugate with S. Relations (8) and (9) in 
explicit form state that 


Va(b)= | S(bla) Ya(a) da, (8" 
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[ sero S* (c/a) dc = 6(b —a), (9’) 


in which the integrals in (8’) and (9’) are to be understood in the gener- 
alized sense, as being suitable for cases where the spectra of possible 
values for a and b may have discontinuities or may even consist of 
separate points (discrete spectra). In the latter case the integral reduces 
to a sum. The matrices S are derived from linear equations (see textbooks 
on quantum mechanics, e.g.‘). 

The transform of (8) allows us to consider the wave function as an 
objective characteristic of a quantum ensemble, but one which may be 
given in various representations. This important feature forms the basis 
of the concept of a quantum ensemble. In this concept the wave function 
Ww 4 is considered as the equivalent, in quantum theory, of the classical 
probability W,(p, q) of a state for the system in phase space R(p, q). 
A knowledge of W,(p, q) enables us to determine the probability of any 
other set (P, Q) of dynamic variables in phase space; similarly, a know- 
ledge of &, for a quantum ensemble enables us to determine the proba- 
bility of any complete set (a) of dynamic variables. Hence the wave 
function is not the quantity that determines the statistics of any particular 
measurement but one that determines the statistics of a quantum ensemble, 
i.e. the statistics of any measurement compatible with the nature of the 
microsystem p and of the macroscopic setting .# that governs the con- 
ditions of motion for p. 
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CHAPTER V 


THE DENSITY MATRIX 


In certain cases the macroscopic setting is not sufficiently defined and 
must itself be given by a statistical description. This naturally introduces 
a further uncertainty into the quantum ensemble, which is then called 
mixed, to distinguish it from the pure ensemble that occurs when the 
macroscopic setting is completely defined (see Chapter IV and also refs. 
2 and 4). 

A very simple example of this situation occurs when there are several 
incoherent sources of particles, which may themselves differ from one 
another in momentum, polarization, or other parameters. Let the first 
source generate a particle with a probability 7, and the second one with 
a probability 7. ,, (-“@, and -#, denote the macroscopic settings of the 
sources). The first source could produce a pure quantum ensemble de- 
scribed by the wave function W_,,(q), the second similarly producing one 
described by W,,(q). We thus have a quantum ensemble that must be 
described via a set of probabilities: 


Py, and Py, 
and a set of wave functions 


Way (q) and Wa, (4). 


With Py +P 4,=1, we have P,, and Py, as statements of the pro- 
portions in which the pure ensembles are mixed, these ensembles being 
described by Wy, and Wy, in our mixed ensemble (@=.%,+-4,). It is 
clear that in the general case we have any set of probabilities (including 
an indefinitely large one): 


Pus Sree es Pps cies Veg (1) 
and the corresponding wave functions 
Wen Wir ecesig) Wty Seisve (2) 
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This description via the two series of 7.4, and W_,y, is very inconvenient 
and, at first sight, seems to take us far from the analogy with the classical 
Gibbs ensemble. However, the analogy is restored if we replace the wave 
function in the description of a mixed quantum ensemble by a quadratic 
form derived from the wave function, which is termed the density matrix: 


Pu(qs q') = 2 Pu Wud) Va,(4); (3) 


in which q and q’ denote two different points in the space of some 
complete set of dynamic variavles (configuration or momentum ones). 
The q points correspond to the horizontal lines of the matrix, and the q’ 
points to the columns. A diagonal element (¢=q’) represents the proba- 
bility of finding the value g for the dynamic variables in a mixed quantum 
ensemble. In fact, in this case we get the usual formula for combining 
the probabilities of independent events: 


Wa (4) = pau(@ 4) = 2 Py,\Wala - (4) 


For the particular case of P,.=1, all other A, are zero, and we return 
to the initial formula for a pure ensemble: 


Wula) = Wu (ql? . (4’) 


The elements of this density matrix satisfy certain symmetry conditions. 
W.4(q) and its conjugate w%,(q) describe the same state; (3) implies that 
p(q, q’) is the Hermite matrix: 


pals 9) = pala 7) = palq's 4): (5) 


Further, if the microsystem itself consists of identical particles (or does 
so at least partly), then w_,(q) either remains unchanged when identical 
particles i and & are interchanged (particles obeying Bose-Einstein sta- 
tistics) or reverses its sign (particles obeying Fermi statistics). Let P, be 
the operator denoting interchange of the dynamic variables q; and q, of 
particles 7 and k; we then have for the wave function 


Fiala) =+ Wa(4)- (6) 
The density matrices are then 
Ppa, N= pul], 4): (7) 
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Pypuld', D=+ pa(qs 4): (7') 
Pay pald', N=+t+ paWs 4): (8) 


The last relation indicates that the matrix is symmetric with respect to 
particle interchange (7,,,=P,P, in operators). 

Let some function of the dynamic variables be represented by the 
operator £, whose matrix elements in the q representation are L(q’, q); 
quantum mechanics shows that the mean value L of the physical quantity 
represented by & is 


L= [vaca L(q’, 4) Wa(q) dq’ dq. (9) 


Then (3) shows that this mean in a mixed ensemble described by the 
matrix of (5) is 


aa | 4 (q, 4’) L(q’, 4) dq’ dq=S,(pa¥), (10) 


in which S, denotes the spur of matrix (p_4f). 

The density matrix may also be given in a mixed representation, the 
lines denoting the values of one set of dynamic variables (q) and the 
columns those of another set (p): 


0.a(q P) = 2 Py Vu. (9)'Va,(P): (11) 


whereupon, in accordance with the general rules for transforming the 
wave function from one set of variables to another, we have: 


Wa.(P) = [s (p, 9) Wu,(9) dq, (12) 


in which S is the matrix for the unitary transformation from the q 
variables to the p variables. 

If by the gq variables we understand the coordinates and by the p ones 
the momenta, then the density matrix becomes the direct analogue of the 
classical density in phase space, W(q, p). However, there is! a more 
direct relation between the classical density in phase space and the 
quantity 


R.a(4; P) = p.u(4; P)S *(q, P), (13) 


28 


THE DENSITY MATRIX 


in which S~'(q, p) is an element of the matrix inverse to S(q, p). The 
matrix S has the following elements when the coordinates g and momenta 
p are cartesian: 


exp ((ipq/h) 
| 2th 
and all the relationships become particularly simple. It is readily shown 


from (11), (13) and (14) that 


Ra(a)= | Ralds 0) <2 (15) 


(14) 


2nh 


is the probability of finding a value g for the coordinates in the ensemble, 
while 


d 
Ria(p)= [ Rata, p) a (16) 


is the probability of finding momenta equal to p. Formulae (15) and (16) 
coincide exactly with those of classical theory if R_,(g, p) is considered 
as R(q, p), the density in phase space. Finally, we have the formula for 
the mean L of the physical quantity represented by the operator & whose 
matrix elements are L(q, p) in the g—p representation: 


_ dq dp 
pe | Ria, P) L(g, ») SP, (17) 


in which R*(q,p) is the complex conjugate of matrix R,,(q,p), while 
L(q, p)= [ra q’) S(q’, p) dq’+S~*(q, p). (18) 


Formulae (15)-(17) are so closely analogous to the corresponding 
formulae of classical mechanics that one is tempted to base the de- 
scription of a quantum ensemble on the density matrix R(gq, p) rather 
than on the wave function; however, a more detailed examination shows 
that this temptation should be resisted. This aspect will be discussed in 
more detail later; here we merely note that rather cumbrous conditions 
replace the simple ones for the wave function and the matrix when the 
arguments (dynamic variables of identical particles) are permuted, as do 
the conditions for hermiticity in the operators that represent physical 
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quantities in the language of the R(q, p) matrix. For instance, it is readily 
shown! that (5) leads to the integral relation 


(19) 


vee oe _En\ dé dy 
R"(q, p)= J Ra+8, pn) exn(' ) oaR? 


h 
while conditions (7) and (7’) (for symmetry in the interchange of identical 
particles, become 


P,R(q, p)=+ RG, P) exp(— ; (Di — Px) (Gi — ww), (20) 


P,R(q, p) =+ RG, P) exp( - ; (pi — Px) (4i — ws) (20’) 


PygR(4; P)=R(4; DP)» Poqg= PpPo: (20”) 


in which q; and q, are the coordinates, and p; and p, are the momenta 
of the interchanged particles i and k. #, is the coordinate interchange 
operator and #, the momentum interchange operator. 

All these relationships are readily satisfied if R(q, p) is considered as a 
bilinear form from the wave function given in the qg—p representation. 

The practical value of the density matrix R(q, p) is that it may be 
expanded as a power series in Planck’s constant if the quantum ensemble 
differs little from a classical one: 


R= ¥W'-R,. (21) 


Condition (19) may also be put as a power series in /i, for which 
purpose we expand R(q+¢, p+n) in powers of & and n, then using the 
equation 


Lam = [ [eor-en(4 a5 ot (+ in)" oe Onm+ (22) 


h m! 


This gives us from (19) that 


(ih)" 6°"R (q, p) 
nt! dq” dp” 


R*(q, p) = (23) 


n=0 
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and, substituting from (21), we get 


i 0°R 
Ro = Ro, Rj = Rk, +— ny 
1! dq dp 
i (24) 
i @°R, 1 @*R, 


* 
R; =R, +1 3q Op 2! dq? op ap? 

These relationships show that matrix R(q, p) can be real only in the 
limit 4-0, i.e. for a classical ensemble; it is obliged to have a non-zero 
imaginary part for #0, so the matrix R(gq, p) is not the probability 
density in phase space R(q, p), as we would expect from the uncertainty 
principle, which forbids an ensemble with exactly known coordinates q 
and momenta p. 

A special position arises for an ensemble containing identical particles. 
The symmetry conditions (20) and (20’) contain Planck’s constant in a 
form such that a singularity is bound to arise for 4-0; this makes 
it impossible to expand R as a power series in fi. This difficulty is 
avoided ?»3 if the matrix R is replaced by a matrix averaged over the 
phase volume: 


1 
Rai [Rap dd, (25) 
2 


in which 2 is the volume of the region used in the average: 
P-A<p<PtdA, Q-5d<q<Q+6, A6=Q2, (25’) 


The significance of this average is as follows. Relations (20) and (20’) 
imply that R in Fermi statistics becomes zero on all hyper-planes in phase 
space for which p;=p, or q;=q, (neglecting complications which arise 
from the spin), while for Bose-Einstein statistics the matrix has turning 
points with respect to the variable p;—p, or q;—q, on the surfaces p;=p,, 
9i= 4, (matrix p, which may be considered as the amplitude of matrix R, 
is seen from (13) to have turning points on the surfaces p;=p, and 
g;=4,). A region of the zero value or turning point extends over roughly 
the wavelength 7~h/p of the particle. These regions do not play an im- 
portant part if there is only a small probability of finding two particles 
in a distance comparable with the wavelength; this probability is 2° if 
there are » particles per unit volume, and for 7° <1 we can choose a 
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volume Q such that there is only a small contribution from the anomalous 
regions (where q,;~q, and p;~p,) in the whole of phase space. 

Condition 437 <1 may be put as p°n>/h?, which means that the phase 
volume per particle must be large relative to 2°; if this is so, we can 
choose a volume Q for averaging R, such that p?/n> Q >h°. The R of 
(25) then has*»3 no important singularities with respect to hA. 
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CHAPTER VI 


CAUSALITY IN QUANTUM MECHANICS 


Quantum mechanics is essentially a statistical theory, so that the history 
of an individual particle may be traced only in very general terms (see 
Chapter XV). We have seen that even in classical mechanics the input 
data must be corrected from time to time to correct cumulative effects 
arising from errors in the initial values. This feature becomes extremely 
important in quantum mechanics: a particle may be localized in space 
only approximately, and the localization steadily deteriorates as time 
passes, and the more rapidly the more precise was the original locali- 
zation; this is a direct consequence of the uncertainty principle. 

This is in no way an attempt to belittle the value of quantum mechanics. 
All the more so, the statistical character of quantum mechanics should 
not be understood (as it is sometimes done) as indicating incompleteness 
or the need to seek a completely deterministic theory, for there may be 
no such theory; of course, we should not interfere with anyone who is 
attempting the apparently impossible. 

From what has been shown above regarding the illusion of determinism 
in the classical theory, where at least it had a certain basis in that the 
initial data could be discarded relatively slowly, it is natural to conclude 
that this illusion becomes for the microworld simply dangerous self- 
deception. However, this does not mean that causality has no place in 
statistical theories. 

Causality is a definite form of order in events in space and time; this 
ordering imposes restrictions even on the most chaotic events, and it 
makes itself felt in two ways in statistical theories. Firstly, the statistical 
laws themselves are fully ordered, and the quantities that characterize an 
ensemble are themselves strictly determined. Secondly, the individual 
elementary events are also so ordered that one may influence another 
only if their relative location in space and time allows this without 
violating causality (i.e. the rule ordering the events). 

In the relativistic theory, the events must be linked by a light signal 
or by some other signal propagating with a velocity lower than that of 
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light. In the non-relativistic theory (the one envisaged here) all the 
velocities are so much slower than that of light, that the latter may be 
taken as infinitely high, the theory being constructed formally in a way 
such as to allow signals that produce interaction to travel with infinitely 
high speeds. Figure 5 shows the ‘‘spheres of influence”’ in the relativistic 
and non-relativistic cases. In the latter, each point on the plane t=0 
influences the points on an infinitely close plane t+dt, because the light 
cone opens into a plane for coo. Determinism in the classical case then 
means simply that the state of the system at a preceding instant complete- 
ly determines the state at a subsequent instant. 

The state of a system means in quantum theory that the system belongs 
to some quantum ensemble, which is characterized by a wave function 


t 


fP x 


Fig. 5. Region within which an event at point P can influence events at t> 0: 
a) relativistic theory, b) classical theory. 


in the pure case, or by a density matrix in the general case. Consider 
first the pure case. Since the wave function ¥ as a time function provides 
an exhaustive description of the state of the ensemble, the above formu- 
lation of the causality concept implies that the subsequent state of the 
ensemble is also determined by the wave function. This ¥ at time ¢ has 
an increment d¥ in time dt, which must itself be expressed in terms of V. 
Further, this wave function itself is essentially defined with an arbitrary 
factor N, so ¥ and ¥”"=NY represent the same state. This factor must 
have a modulus of 1 for the normalized function (i.e. N=exp(ix) in 
which « is an arbitrary real number), so d¥ must be linearly related to V. 
We have 

Iva 5. (1) 
in which # is some linear operator. Let the wave function at time t+dt 
be Y'=W+d¥; then the previous relation may be put as 


Y= S(dt) ¥, 
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in which S(dt) denotes a matrix that transforms Y into Y’. The conser- 


vation of 
| [VP dq 


implies that this matrix is unitary, i.e. that 
S* (dt) = S* (dt) = S7* (dt). 
We put S=1+ 2 dt+--- to get 
L = —(ilh)# 


in which the operator # is hermitian (#/*=3* =#), while h is intro- 
duced as a factor from dimensional considerations and correspondence 
with the classical theory. # is essentially the operator of displacement 
in time and is Hamilton’s operator. If # itself is not dependent on time, 
it is simply the energy operator of the microsystem. 

This shows that the equation for the variation of the wave function in 
time must be put in the form 


ov 
ih-—= FV. (2) 
ot 


This is Schrédinger’s equation, which expresses causality in quantum 
theory; Y describes the quantum ensemble completely, so we may say 
that (2) describes the motion of the quantum ensemble and that in a 
causal fashion, i.e. so that the state earlier in time determines the sub- 
sequent state of the ensemble. 

The equation for the conjugate function Y™* states that 


ay* 
— ih sai = H*y* | (2') 
Ot 
and for #*=# we have 
ow* 
—ih _-=#¥". (2”) 
Ot 


The differential equation of (2) may be replaced by the corresponding 
integral equation. Green’s functions may be used for a system of non- 
interacting particles. Essentially, #” may usually be put as the sum of the 
operator 7 for the kinetic energy of the particles and V, the operator for 
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the interactions of the particles mutually and with external fields. T has 
a particularly simple form in a cartesian coordinate system: 
N 


ite? oP. OF 
T =- ae ee ee eee 3 
) 2m, @ ay; =) e 
1 


s= 


in which m, are masses of particles (s=1, 2,..., N, with N the number 
of particles in the system), whose cartesian coordinates are x,, y,, Z,. The 
Green’s function G by definition is the solution to a Schrodinger equation 
whose right-hand side takes the form of an instantaneous source at point 
x= x’ and time t=t’ 


0G 
he Se) O(n x) (4) 


being zero for t<t’. (Here by x and x’ are meant all the coordinates of 
the particles, so that 6(x—.x’) is a 6-function in 3N dimensions). 

This equation coincides with the diffusion equation in multi-dimen- 
sional space, but with an imaginary diffusion coefficient and an instan- 
taneous particle source at x=x’, t=t’. The solution to the equation is 


, ; 1 (x, — x) 
G(x-—x’,t-fr)= Ey agar ?(- men, | (5) 
for t'’<t and 
G(x —x’,t-—“)=0. (5’) 
for t'’>t. 
We may put (2) in the form 


aV (x’, t’) 


i; 
at 


~T (x) W(x, 1) = V(x, e) V(x, er). (2) 


Multiplying (4) by V(x’, t') ¥(x’, t’) and integrating with respect to 
x’ and t’ we find that (x, ¢) satisfies the integral equation 


P(x, 1) = Vo(x, + { G(x —x',t 1) V(x', 2) ¥ (x, dx’ dt’, © 
in which Y9(x, ¢) is the solution for V=0. 
This equation shows that there will be no effect on the state of the 


ensemble, i.e. on (x, ¢), from a change 6¥(x’, t’) in the wave function 
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or an interaction 6V(x’, t’) occurring at point x’ at time t’ (an event) if 
t'’>t; in other words, the state at any instant t is influenced only by 
events occurring at t’<t. 

Equation (2) not only gives us the state in the immediate future from 
the present state, but also the state in the immediate past; we merely 
have to take dt<0. The “‘delayed”’ Green’s function of (5) may thus be 
considered together with an “‘advanced”’ one: 


y 1 (x, — xj)? 
G = e t = t’ = oe 5” 
Si ) i Se TCT aca aaa) ny 
for t'>t and 


G(x—x’,t-r)=0 (5””) 


for t’<t. This function enables us to determine the state in the past. 
This possibility arises from the reversibility of quantum mechanics in 
time. 
A. EQUATIONS OF MOTION FOR A MIXED ENSEMBLE, 
DENSITY MATRIX p(@, q’) 


We differentiate Equation (V-3) with respect to time on the assumption 
that the conditions determining the mixture remain unchanged, so that 
P,= constant; we obtain 


ee = ) Pa, eee Vasa) + ¥%,(q') — ea (7) 


s 


and use (2) and (2’) to get that 


— -\9 ie a{- HOU EQ) t+ PT) Ae aD}, (7) 


s 


or, since 


KH (q'; q") = H* (q’, q") a Het (q', q'), 
we have 
op (4, q') 


= +L, 9] =0, (8) 
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in which [#, p]| is the quantum Poisson bracket: 


if 
[A, B] = - (AB — BA). (9) 
ih 
Equation (8) may be put in the form 
dp 
—=0, 10 
4; (10) 


which expresses Louiville’s theorem in quantum mechanics; (8) or (10) 
expresses a CauSality essentially equivalent to that expressed by (2): the 
first derivative of the density matrix p is itself defined by that matrix, i.e. 
the future state is determined by the previous state. 

If from p(q, g’) we pass to the matrix R(q, p), we find (see references!:?.3 
of Chapter V) that the following expression for the matrix R after simple 
manipulations is: 


OR(Q, P) dR 
aR = 0; 11 
Ct : ] dt oy 
where 
1 (de dy ign 
ZF. R| = -.- | —— eset 
ee) ALS exp( ) 


{H(q, p+n)R(q+é,p)-—H(qt+é,p)R(q,ptn}. (1) 


We expand this integral as a power series in f, for which purpose we 
use formula (V-22): 


aR(q, p) (= int - 
n=1 


in which 
0H OR @H OR 
dp” 0q” aq” ap” 


[H, R], = (13) 
in the classical Poisson bracket of order y. If # equals T(p)+ V(q), the 
matrix elements of # in the q, p-representation are simply equal to the 
corresponding classical hamiltonian: 

2 


H(q, p)= a + V(q). (14) 
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Then (12) takes the form 


OR p OR i“ aV OR ih OR we ih)" a"V O'R 
ot m 0q  0q dp On dq? n! aq” dp on 


(15) 
n22 

The first two terms in this equation coincide exactly with the classical 
equation for the density p(q, p) [see ([V-4) or (IV-5)], while the subse- 
quent terms may be considered as quantum corrections, which are de- 
pendent on the higher derivatives of the density R and of the potential V, 
which include derivatives with respect to p as well as ones with respect 
to q. These equations may thus be useful when the density function R 
is a reasonably smooth function in the phase space R(p, q). 

The first two (classical) terms in (15) show that the motion in the 
classical approximation is governed by the speed p/m at the point in 
question and by the force dV/éq acting at that point. The subsequent 
(quantum) terms indicate the steadily increasing importance of the en- 
vironment in (p, q) phase space: the 6*R/dq* term leads to diffusion of 
the density R(q, p) (diffusion coefficient imaginary) while terms of the type 
(0"V/dq") (6"R/Op") indicate that the motion of the ensemble in quantum 
mechanics is governed not only by the form of the potential at the point in 
question but also by that throughout the space accessible to the ensemble. 


B. EXPLICIT FORM OF THE EQUATION FOR THE 
DENSITY MATRIX p (q, q’) 


This is found in the coordinate representation by noting that the matrix 
elements of # take the following form in the q representation: 
2 2 


H(q, q')= aa — 6(q-4q')+V(q') 54-4’). (16) 


2m aq 0q' 


Substituting this expression into (8) and introducing the variables 
Q=(q+q')/2 and €=q—q’, we find 


6p ith 6*p ee C 7 -5)f 7 
a m0Q oe WY (2+5) v(2 | tn 


This form of the equation of motion for the density matrix 


p(4; 7) = p(Q, ¢) 
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corresponds! in classical mechanics to the equations of motion for the 
Fourier transform of the classical density p.(Q, P) in phase space: 


P 
p.(0, 8)= f pe(0.P) exo(- 15") ar (18 


(here / has so far been inserted purely formally). In fact, converting from 
p-(Q, P) to p.(Q, €) and putting 


H(Q,0)= | H(0,P) exp(— "| ap, (19) 


we get 


wD. Faro, PEED ap 0 


0Q 


x ee d (20) 
dQ 
and in particular, when 
Pp? 
HOR) V(Q) 
we get explicitly from (20) that 
ape(0, 6) __ ih p.(0, 6), 1, 2¥(Q) ae a ae 


dt = 3m éQdeé ih 


We see on comparing this with the quantum equation (17) that the 
latter becomes (21) if 


ie. if the potential V(Q) is a sufficiently smooth function of the co- 
ordinates Q. 

This shows that classical mechanics can also accommodate the method 
of describing the state of an ensemble via pairs of points g and q’ in 
configuration space R(q). 
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IS THE WAVE FUNCTION AVOIDABLE? 


The basic equation of quantum mechanics — Schrédinger’s equation — 
resembles the equations for the density matrix in that they can all be put 
in a form very similar to the equations of classical statistical mechanics 
or of the mechanics of a host of particles. This has given rise to a temp- 
tation that has more than once led to consideration! of quantum me- 
chanics as a form of classical mechanics supplemented by some novel 
quantum force. The present author himself once hoped that the striking 
similarities between the equations for the R(q, p) matrix and the equations 
of classical physics would allow the formulation of quantum mechanics as 
the statistical mechanics of quantities not simultaneously measurable. 
The basis of the theory would then be not the wave function but the 
R(q, p) matrix, the analogue of the classical density p,(q, p) in R(q, p) 
phase space. All such attempts employ, in one way or another, equations 
that are (in wave-function language) non-linear with respect to the 
wave function. This feature has given rise to a serious obstacle to such 
attempts. 

Let us consider first what has been called the “hydrodynamic” form 
of Schrédinger’s equation. Here the theory is based on the probability 
density p(x)=|w(x)|?, with 0(x)=arg w(x) as the phase of the wave 
function. Each of these has a classical interpretation: p(x) may be con- 
sidered as the mean density of the particles while VO(x)/m may be con- 
sidered as the mean speed of these at the point x, so that J/=pV@/m has 
the meaning of particle flux density. However, the equation for these 
quantities are non-linear; if in Schrédinger’s equation 


oe oe 
era a ey (1) 


(for simplicity considered for one particle) we insert a wave function of 
the form 
W = p* exp(i0/h) 
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we obtain two equations for p and 0: 


0 1 

= +av(p i v0) =: (2) 

Ct m 
o0 1 h? (V?p (Vp) 
—-+— (VOP+V4--(— +°-= ]=0. 2 
Ot 548 ) a p p? } a) 


Both may be considered as classical; the first as the equation of 
continuity for the particle density p and for the flux J=pV0/m, and the 
second as the equation of the action function 0. The second equation 
coincides with the classical equation if the ‘““quantum” term 


h? (“? 2 wr) 


m\ p p 


is considered as an additional “quantum” potential dependent on p and 
on derivatives of the latter. 

At first sight this interpretation of the equations appears to be free 
from conflict, while the equations themselves are equivalent to the initial 
Schrédinger equation of (1). However, this is not so, because the principle 
of superposition of states plays a fundamental part in quantum mechanics. 
This principle indicates that, if there are two states of the quantum 
ensemble represented by wave functions fy, and Wy,, we can produce 
a macroscopic setting #=.%,+-.#, such that the new ensemble will be 
represented by the wave function 


Wa = CW, + Cob, (3) 


in which C, and C, are arbitrary numbers subject only to the normaliza- 
tion condition 


IC, [> +1C,|* =1. 


This simple but important principle cannot be expressed in the language 
of p and 0 without resort to the wave function again. An even greater 
inconvenience arises in formulating the symmetry conditions for a system 
of identical particles. These conditions for the wave function state that 


PFAxwa=Hty, (4) 
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in which #,, is the interchange operator for particles i and k, while the 
sign ++ depends on the statistics (Fermi or Bose-Einstein). This simple 
condition is formulated in a clear fashion in the linear apparatus of 
quantum mechanics, but it is reflected only in a very obscure fashion in 
the language of p and 0. For instance, suppose we have two identical 
particles; then we may consider (1) and (2) as equations of functions 
describing the relative motion of these particles, while the motion of the 
centre of gravity is distinguished in the usual way by separating the 
variables. Symmetry condition (4) is then transferred to the function for 
the relative motion (the function describing the motion of the centre of 
gravity is automatically symmetrical with respect to exchange); it means 
that (x) must be even or odd when x is replaced by —x, and (4) thus 
implies that 


p(—x)= p(x), 0(— x)= O(x) +28, (5) 


in which S=0, 2, 4,... or S=1, 3, 5,.... 

The condition on p(x) is mathematically of no inconvenience, but the 
second condition is extremely troublesome; if we seek ways of meeting 
this requirement, we find that the simplest way of satisfying (5) is to 
return to the wave function and the condition that this be even or odd. 

Another form for the quantum-mechanical equations is close to that 
of classical statistical mechanics and is based on the use of the R(q, p) 
density matrix. Formulae (V-15), (V-16), and (VI-12) show that there is 
a close analogy between the equations of classical statistical physics and 
the equations for the quantum density R(q, p). However, we have seen 
(Chapter V) that matrix R(q, p) differs from the classical density in being 
obliged to have a non-zero imaginary part, so in the simple sense of the 
word there cannot be a positively definitive probability or density in 
R(q, p) phase space. The complex character of R(q, p) is an expression 
of the principle of complementarity, but it might have been hoped that 
R(q, p) could be considered as an extension of the probability density 
concept to the case of variables g and p not simultaneously measurable, 
this matrix being used as the basis of the theory instead of the wave 
function. However, the matrix R(q, p) is non-linear with respect to the 
wave function, so all the difficulties noted above for the p—0 description 
also occur here. In particular, the difficulty over (3) (principle of super- 
position) that arose with p and 0 persists when R is used. 
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In addition, it is extremely difficult to meet the hermiticity conditions 
(V-19) and the symmetry conditions (V-—20) for the case of identical 
particles unless we resort to the wave-function concept; it is probable 
that they may be satisfied in general form only by considering R(q, p) 
as bilinear in W*(p) and w(q). 

These difficulties are common to all attempts to formulate quantum 
mechanics in the language of quantities non-linear with respect to the 
wave function; throw the latter out of the door and it will come back 
through the window. This persistence of the wave function is an ex- 
pression of the fact that the language of the linear theory is that of the 
essential nature of quantum mechanics; translation into any other tongue 
will always cause the poetry of this linearity to sound weaker than in the 
original, 
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IS THE WAVE FUNCTION MEASURABLE? 


We have defended the use of the wave function, but we now must consider 
what it represents and whether it can be measured or established by 
experiment, as for example a particle density or the temperature of a gas. 
Frequently the answer is given in the negative, as the wave function is 
essentially undefined as regards a phase constant; the two wave functions 
wy and w’ related by 

y' = ye", (1) 
(in which a is a real number) represent the same quantum ensemble. 
Hence in speaking of the wave function as measurable or otherwise, we 
will from the start not demand the impossible (or rather meaningless) 
and will understand by measurement of the wave function merely 
measurement apart from the phase constant a. 

We may avoid a complicated argument by considering the function in 
explicit form in terms of the coordinates x of one particle (by x we might 
also understand the relative coordinates of two particles). The wave 
function is complex in the general case, and we put it in the form 


W(x) = [W(x e”™, (2) 


in which |y(x)| is the real amplitude and 0(x) is the phase. There is no 
fundamental objection to the experimental determination of p(x), the 
mean particle density, so this amplitude may be determined, since by 
definition 


p(X)=WO?, WOI=+ eo), (3) 
Further, the mean current density may also be measured: 
h 
J =— pV0. (4) 
m 
Then 0(x) is given by 
m{J 
0(x)= — | - ds, 5 
Ola (5 


xo 
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in which ds is a length element and xX, is some arbitrary point, the arbi- 
trariness arising from the arbitrary choice of the phase in (1). For clarity 
we suppose that we are considering the measurement of p(x) and 6(x) 
in an atom or ion with one electron in an s-state or a p-state. In the first 
case the wave function has spherical symmetry and currents are absent, 


SO — 
w(x) = /p(x) (6) 
and measurement of p(x) gives us directly the values of the wave function. 
In the second case the wave function assumes the following form in 
polar coordinates r, 9, ¢: 


w(r,9,¢) = A(r, Oe'”®, m=0,+1, (7) 


the double sign corresponding to the two orientations of the atom in a 
magnetic field. We assume that this field is used to produce one of the 
possible orientations; then we measure p(x) to find the modulus of the 


function: eo 
A(r,0) = J e(r, 0), (8) 
while by measuring the magnetic moment of the atom, which is 


eh 
AM =— m, (9) 

2puc 
( being the mass of an electron) we find also the phase 0=md¢ of the 
wave function. These simple examples show that the wave function can 

be measured. 

The above examples concern measurements on bound states. Consider 
now the case of particle scattering, where the wave function takes the 


form w(x) = eikx +u (x), (10) 


in which exp(ikx) is the primary incident wave (whose wave vector is 
k=1/4) and u(x) is the scattered wave, the latter taking the following 
form at great distances from the scattering point: 

ikr 


u(x) = — A(k,9), (11) 
oe 
A(k, 9) = a) " — 1)P,(cos$), (12) 
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in which n,(k) are the phases of the scattered waves and Y,(cos 9) are 
Legendre polynomials. We assume that we measure the particle density 
(although it would be nearer to the practical case to speak of measuring 
the particle flux density), which is 


WI? = [Wo + ul? 
and which contains the interference term 
Wou™ a wou > 


which complicates the argument. However, the difficulty is readily 
avoided by realizing that description of the incident wave as a plane 
wave represents an abstraction, since strictly speaking a beam bounded 
in space (a wave packet) should be considered. Then we observe particles 
scattered far away from the primary beam, thus avoiding interference 
between the primary and scattered waves (note that these waves are 
coherent). 

This is a realistic representation of an experiment for measuring |u|? 
in the usual way; experiment thus gives us 


1 
|u|? = 3IA(k 8)’. (13) 


Hence we find the amplitude of the scattered wave A(k, 9), but not the 
phase; in general, 
A(k, 9) = jA(k, 9)| ee 


and the phase «(k, $) is lost from the measured quantity; an additional 
measurement of the flux density of the scattered particles yields nothing 
new, since the formula for the flux density 


ih 
fe: -(u*Vu — uVu*) (14) 
m 
implies that 
hk, 
J =—-|ul 
m 


so J is simply proportional to |u|*. The phase uncertainty gives rise to 
the serious and troublesome problem of recovering the phase from the 
experimental data, by which is meant the part of the phase «(k, 9) that 
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is dependent on the angle $ and momentum k. The problem may be 
illustrated by reference to two particular cases. First we assume that only 
the first phase differs from zero (S-scattering); in this very simple case 
we get from (12) that 


1 
|A (k, nl = Ak? sin? Ho> (15) 
sin Ng = + |A(k,n)| 2k, (15’) 


and so the phase of the S-scattering is determined apart from sign. Here 
there thus arises an ambiguity, which is not entirely irrelevant for it 
conceals, for example, the question whether the forces between the 
particles are ones of repulsion or attraction. 

This phase no is related! to the particle interaction potential V(r) 
(for weak interaction) as follows: 


amk nx \* ¢ 
ier rool (se-) I, (9 | r’ dr 
0 


in which J,(z) is a Bessel function, so that the sign of V(r) determines 
that of m1. 

Consider now the more complicated case where both phases are 
different from zero, e.g. the S-phase and the #-phase. Here 


1 
|A(k, 9)? = ve {sin? 79 + [2cos (no — 1) — 2cos 29 — 2cos2n,] cos 9 


+sin?, cos*9}. = (16) 


The coefficients of cos® 9, and cos’ 9, and cos? $ may be determined to give 
us the phases 7, and yn, apart from sign. 

These simple examples show that the problem of recovering the phase 
of the wave function (even the asymptotic value) is far from simple, 
although the difficulties are not fundamental. This is illustrated by the 
observation that for (say) proton-proton scattering we may deduce the 
sign of 49 from the interference between the nuclear scattering and the 
scattering arising from the Coulomb field of the nucleus. The differential 
cross-section for elastic scattering of protons into a solid angle dQ (after 
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averaging with respect to the spins, see ref. 2) is as follows when the 
specifically nuclear interaction is included: 


e* 1 1 1 1 
do($ Se ONY a ean Nard ae 
(9) lsat +9 costa sin? 9 cos? 9 
hv sin? no 2hv . 4 (17) 
mare ae +—, sin“ no |dQ, 

e 


e” sin? 9 cos” 9 
in which E is proton energy, v is the relative velocity of the protons, e 
is the electronic charge, and y, is the phase of the nuclear interaction. 
We may assume that this cross-section is known from experiment, and 


consequently that we know the correction to the pure Coulomb scattering 
arising from the nuclear interaction in the S-state. This correction is 


e* hv _ sin? 2h 
f (E,9)d@ = —— cos | 5 = Nt -( 7 sin? na)| dQ. (18) 


e” sin 2 9 cos? 3 


We differentiate f(£, 9) with respect to 2 for fixed energy to get 


Of(E,9)_ et hv, “f 1 


69. EX et" Go| sin? 9 cos? 9 |’ 8) 
which defines 4). Allowance must be made for higher phases at higher 
proton energies, which makes the problem more complicated. 

These simple examples show that the wave function can be measured. 

Let us now consider in more detail the measurements actually needed, 
first in relation to particle scattering, which for simplicity we assume to 
be elastic. The directly measured quantity is the differential cross-section 
which is related to the scattering amplitude by 


o(9,k) =|A(9,k)|?. (19) 


We have just discussed the determination of the amplitude itself, so 
we shall now concentrate attention on the determination of the cross- 
section o. From its definition we have 


ds 
dN =0-J -=0'J dQ, (20) 
r 


in which J is the current density in the incident wave and dW is the 
number of particles scattered in unit time into an element of solid angle 
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dQ. This implies directly that it is insufficient to observe a single act of 
scattering in order to determine a, for this tells us exactly nothing about 
a(9, k) as a function of the scattering angle 9. This single act of scattering 
may be typical (probable) or atypical (improbable). We must observe 
many acts of scattering at various angles in order to obtain o(9, k) as a 
function of 9 from experiment, and the numbers must be such as to rule 
out errors arising from random fluctuations. This is essentially a trivial 
feature known to any experimentalist, and it needs to be mentioned only 
because it is stated in many textbooks on quantum mechanics that the 
wave function is a characteristic of the state of a single particle. If this 
were so, it would be of interest to perform such a measurement on a single 
particle (say an electron) which would allow us to determine its own 
individual wave function. No such measurement is possible. 

This contradiction is abolished if we bear in mind the idea that the 
wave function is a characteristic arising from the existence of a quantum 
ensemble for the particles, i.e. if we consider a particle » together with 
its macroscopic setting -@, the latter governing the conditions of motion 
of the former. 

Now we may turn to measurement of the wave function for a bound 
state; here we encounter some features that deserve attention. 

In the above experiment on the asymptotic behaviour of the wave 
function in elastic scattering, the number of scattered particles was 
measured directly by some macroscopic device, e.g. a Geiger counter 
(Figure 6), which was entirely feasible in that case because it was a 


Fig. 6. The primary wave k is scattered by a microsystem A, the scattered wave k’ 
being shown by wavy lines. C is a counter for particles scattered at an angle 9. 


50 


IS THE WAVE FUNCTION MEASURABLE? 


question of observing free scattered particles far from the scattering 
centre, i.e. far from the site of all the microscopic processes. This typé 
of measurement can be called direct. The situation is fundamentally 
different for a bound state: here the wave function is localized in a 
microscopic volume, within which no macroscopic measuring device can 
be inserted. Hence any measurement of the wave function in this case 
must be based on some means of indirect measurement? via a directly 
measurable interaction of the system with some other system. The last 
may be a free particle scattered by the system. 

Hence the indirect measurement of the wave function for a bound 
state (for which no direct measurement exists) reduces to our previous 
experiment on the scattering of free particles. It should hardly be neces- 
sary here to stress again that no single measurement can give us any 
information on the wave function of the bound state. We must find the 
cross-section of the scattered particle as the first step in our indirect 
measurement, since this is used as our probe to study the innards of the 
atomic system. Here we have to ensure (in experimentalists’ terminology) 
good Statistics, i.e., we have to make many measurements on the scatter- 
ing of our probe particle by the system. 

We deduce the differential cross-section for the scattering from direct 
measurements of the number of scattered probe particles; the wave 
function of the system must be deduced via recovery of the scattering 
amplitude A(9, k). 

The above arguments have shown that this problem often does not 
allow an unambiguous solution, but if we assume that we have been 
able to determine the scattering amplitude for the probe particle then all 
the information on the atomic system is contained in the infinite set of 
phases no(k), 1,(k),..-, Ne(k), which are functions of the energy of the 
relative motion of the probe particle and the atomic system. Even in this 
very optimistic case we encounter the complicated mathematical task of 
deriving the structure of the object from its optical image, which is now 
considered as specified via the amplitude A(9,k). Of course, this is 
provided by experiment as the angular intensity distribution for the 
scattered particles, or (which is the same) as the differential cross-section ; 
it should be stressed that this mathematical problem, no matter how 
complicated, is soluble, at least in certain cases, which of itself shows 
that in principle we can pursue the path of indirect measurement. 
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A bound state is a state of an isolated microsystem, so it will be more 
convenient in what follows to speak of studying the structure of an 
atomic microsystem (we have already used this terminology), since in the 
general case we examine not only the wave function of the system but also 
the mode of interaction with the probe particle. The next chapter deals 
with reconstruction of structure from the image of the microsystem. 
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CHAPTER Ix 


DEDUCTION OF THE STRUCTURE OF A 
MICRO-OBJECT FROM PARTICLE SCATTERING 


The basis of our discussion of this topic is Schrédinger’s equation for the 
probe particle. Let m be the reduced mass of this particle, which is found 
from 1/m=1/+1/-4, in which yp is the actual mass of the scattered 
particle and -@ is the mass of the scatterer, and let x be the coordinate 
of the scattered particle relative to the centre of gravity. Then Schrédinger’s 
equation for the relative motion (x) is as follows, any motion of the 
centre of gravity being of no interest: 


5 VU (x) + V(x)W(x) = EW (x), (1) 


in which V(x) is a function describing the interaction between the micro- 
system and the probe particle whose energy is £. The wave function must 
take the form 


W(x) =e" + u(x), (2) 


in which exp(ikz) is the primary plane wave and u(x) is the scattered 
wave, the latter for large distances z=|x|—>-00 being of the form 
ikr 


U(r = —A(ks9), (3) 


where A(k, 3) is the amplitude, which from (VIII-19) determines the 
differential scattering cross-section, i.e. the experimentally observed 
quantity. 

By determination or deduction of the structure we shall understand 
determination of V(x) from data on the scattering of the probe particle. 
The term ‘“‘structure of the object” is of optical origin. We put (2) in the 


= E-V( 


= (4) 


VW (x) + k?n? (x) W(x) =0, n? (x)= | 


and see that n?(x) may be considered as the refractive index of the 
scattering medium formed by our microsystem. 
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Consider the case of weak scattering, when the primary wave is only 
slightly altered within the scattering region. In this particularly simple 
case we may apply the Born approximation, the essential feature of which 
is that w(x) in V(x) w(x) is replaced by the primary wave undistorted by 
scattering, W%o(x)=exp(ikz). This allows (2) to be written in the form of 
an equation with a given right-hand side: 


2 
Vutkeus * V (x) Wo (x). (5) 
1 


The equation for the Green’s function of this equation is 


VG+tkRG=—d5(x— x’), (6) 
which has a solution for divergent waves of the form 
1 eiklx—x | 
G=-— : 7 
4n |x —x'| 7) 


Hence the solution to the inhomogeneous equation (5) is 


1 2I gels 
x) =—— —,- | V(x’ : SN ag § 
u@=— > | (x) Yole') — 9 as () 
which for |x —x’|— 00 becomes 
ek Im ; 
x)= - —_ V x! fax’ 43, é ; 
u(x) se | (x‘)e + (9) 


where q=k’—k is the change in the wave vector due to the scattering; 
|q|=2k sin(9/2), in which 9 is the angle between the direction of the 
primary wave and that of the scattered wave (Fig. 6). Comparison with 
(3) shows that the amplitude of the scattered wave is 
AD eA" VOR a: (10) 
4x h 

This shows that the Born approximation gives the amplitude of the 
scattered wave simply as the Fourier component of V(x). 

Let us now consider the deduction of this last function from data on 
the scattering amplitude 4(q), which we put in the form 


A(q) = a(qje*™ 
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in whien 
a(g)=|A(q) 


and z(q) is the phase; we also note that V(x) is taken as being real (no 
amoorpuon, only ela.tic scattering). We then have formally from the 
inverse Fourier transforrn that 


Snm [ 
Viny=— 2 | Alger aa, (11) 


in which the integral with respect to the transferred momentum is taken 
betscen infinite liraits. 

The first retriction on V(x) arises because we know the amplitude 
only for g- 2k from measurements at probe-particle energies not ex- 
vseding &, with kL? =2mF h*, so that we are obliged to replace (JJ) by 
another integral «ithin the lirnits |g <2k. Consequently we obtain in 
place of the true V(x) an irnprecisely defined representation V(x): 

Vixy= | Alq)e" dq. (12) 
q- 2k 
Thit expret:es the fact that we cannot see details of any object whose 
size ais sraatier than the wavelength Z of the “illumination”. 

The second restriction arives because the phase 4(q) cannot be deter- 
mined; the transform of (12) show that this phase happens to be zero 
for the cave where the interaction function is syrnmetric with respect to 
the inversion x —x. However, the interaction may contain a part not 
syrametric with respect to this transform, in which case ~(q) is not zero, 
50 it is not possible to determine V(x) uniquely from the differential 
scattering cross-section alone. The utility of (12) is therefore very much 
dependent on the assumptions about the symmetry of the interaction 
that may be made before the measurement. 

Consider now the reJation of structure deduction to the measurement 
of the wave function for a bound state. We assume that the investigated 
rnicro-object is a heavy ion, or an atom with one electron. The wave 
function of this electron within the atom is denoted by g,(x). The mean 
charge density this produces at a point x (in which x is the coordinate 
of the electron relative to the nucleus) is 


ep(x) = efga(x):” ; 
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If the nucleus has a charge + eZ, the potential energy of the second 
electron (probe particle) in the field produced by the nucleus and the 
first electron is 


27, x’ 2 
V(x) =- seer leo d°x’. (13) 
r |x — x'| 
The Fourier transform of this is 
A4ne*Z Ane? _ 
Vigy=- ‘a 6(q), (14) 


in which f(q) is the Fourier transform of p(x). 

If p(x) is even with respect to the inversion x —x, we get from the 
above argument that we may determine an approximate representation 
of p(x): 
po)= | ele. (15) 


q<2k 


If it is also known from considerations of symmetry that @o(x) is real, 
we get the approximate transform of this function as 


Bo(x) =./A(x). (16) 


Thus we see that the wave function is in principle measurable for a 
bound state, at least in certain very simple cases when the indirect 
measurements may be interpreted unambiguously. In every case it is 
first necessary to measure the differential cross-section for elastic scatter- 
ing of probe particles, i.e. we must provide good statistics for the acts 
of scattering, which implies repeated scattering of an electron at the atom 
in the specified initial state. At first sight it might seem that in this series 
of repeated scattering experiments we could use, at least in principle, a 
single atom as the target for our probe particles, but this is not so, for 
the essence of the matter is that we must use a probe beam of wavelength 
A= 1/k <a to study the structure of a target whose dimensions are of the 
order of a. The binding energy of a particle in this object is [~h?/2ma’, 
i.e. the kinetic energy of the probe particle is E=h’k?/2m >I, so that the 
atom will be ionized by the flux of probe particles and hence must be 
replaced. 

This argument needs some detailed revision when the probe particle 
mass m differs from the mass yp of the intra-atomic particle; we still have 
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E>I if m<yp, but for m>j we must remember that the Born approxi- 
mation becomes unsuitable for v<vg, in which v is the speed of the probe 
particle and vg is the velocity of the atomic particle, e.g. of an electron 
in an atom. We have vg ~h/ap roughly, so E>TJ is replaced by E>mlI| p. 
Thus £ exceeds the ionization energy of the atom, which is therefore 
ionized by irradiation. In the language of photography, we might say 
that the atom is too restless a subject to give a good portrait — its face 
changes in the most inexplicable fashion during the exposure. 

The situation may be described quantitatively as follows. Let a9 be the 
cross-section for elastic collisions and o; that for inelastic ones; then the 
ratio of the number of acts of inelastic scattering N, to the number No 
of elastic ones will be Q;/Q,. A good “‘photograph”’ requires that N;<No, 
but this condition cannot be met because the condition E>J implies that 
the electron in the atom may be considered as almost free in relation to 
the probe particle, since its binding energy J is small relative to the kinetic 
energy of the incident probe particle. Then 0; exceeds 09; for E>I we 
have’ 

E3 ik: 
am’ “9 3° 


so that No/N;=65/0;-0 for E> I. 

This makes it impossible to obtain a sharp image of the atom by 
irradiating a single atom. Hence it is clearly impossible to deduce the 
wave function of the electron in the atom if we probe the same atom, 
when, as we have seen, there is no guarantee that the atom will not be 
ionized during the measurement. 

This does not mean that it is essentially impossible to observe a single 
atom on, say, the stage of an electron microscope. If we restrict ourselves 
to this less ambitious problem of an atom on some stage and neglect any 
possible changes in the atom during inelastic processes, we may (at least 
for fairly heavy atoms) perform several thousand acts of scattering? 
without displacing the atom from its initial position of attachment 
(adsorption). 


og; = 2na7I log : (17) 
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CHAPTER X 
THE INVERSE PROBLEM IN QUANTUM MECHANICS 


The usual problem in quantum mechanics is that of finding the proper 
wave functions w(x, £) and proper values E of the Hamilton operator 
H (x) (assumed to be known); this is the direct problem. However, we 
may also conceive the inverse problem, in which we have experimental 
data on the energy states E and proper functions (x, E) but wish to 
find # (x). The mode of solution has already been presented for the case 
where the Born approximation may be used. Several recent mathematical 
works!;2,3,4 have presented a method of solving this problem without 
demanding the assumption of weak interaction required by the Born 
approximation (Chapter IX). 

The method may be illustrated by reference to scattering in the S state 
for a system whose hamiltonian is 


h? 
# (x) =—5—V? + U(r). (1) 


Denoting the wave number by 


and putting 
2m 
sr U()= V(r) 


we obtain the Schrédinger equation for the S wave o(r, k), as 


do 
2 tVOe=Ko Q) 


subject to the boundary conditions 


(0, k)=0, 


dep (0,k) _ 
ae? eas (3) 
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and to the assumption that 
| r[V (r)/dr< 4 
0 


(convergence of the integral). The form of g(r, k) for roo is 
p(r,k) = A(k) sin(kr + 5(k)), (4) 


in which A(k) is the amplitude of the scattered wave and 5(k) is the phase; 
the latter is observable, because the total cross-section o for the elastic S 
scattering is 


o(k) =~, sin’ 5(k). (5) 


As regards A(k), we assume for simplicity that the system has no bound 
states (the following relation becomes rather more complicated if these 
are present), and this amplitude may be found from the dispersion relation 


log A(k) = =) aa (6) 


This relation is readily derived if we consider a solution whose behaviour 
as r—co is of the following form instead of the e(r, k) with the asymptotic 
behaviour as of (4): 


A(k 
u(r, k)= AO ) gilkr + 5(k)) (4’) 
the boundary condition being 


Oa, wOH=" 3’) 
k=, uw (O,k)=_. 

ul i 2 

We then have that 


2i : 
eri: a 


S(k) = e20 = f*(k) 


7 
7)’ v 
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in which S(k) is the scattering matrix for the S wave; (7) implies that 
Im log f (k) = 25(k), (8) 


and since f(k) is an analytical function (in the absence of bound states) 
for Imk>0 and 
Re log f (k) = — A(k), 


so that from (8) we apply Cauchy’s theorem to the boundary consisting 
of an infinitely large semicircle and the real axis, evading the point k, and 
hence arrive at (6). 

The solution to (2) that satisfies the conditions of (3) may be put as 


o(r, k) = sinkr + [Ke t) sin kt-dt, (9) 
0 


in which the kernel K(r, t) for 0<t<r satisfies 


@K(r,t) & K(r,2) 


or? or = V (r) K (r, t) (10) 
subject to the conditions 
K(r,0) =0, Kn =4] V@az, (11) 


0 


as may be verified by direct substitution of (9) into (2). Here (11) allows 
us to eliminate the potential V(r) from (10): 
dK(r,r 
V(r) =2 a) (12) 
dr 
However, we will not do this at present but will set out the solution to 
(10), which is readily obtained by successive approximation: 


(r+0)/2 (r+0)/2  (r—2)/2 
K (r,t) =4 | V(z)dz+4 i du,, i du, V (u, + v,) 
(r—1)/2 ri (r—1)/2 0 


[aes [dena Fue + Opa) | V@)az. (13) 
(8) (0) uy 
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This gives us the partial derivative with respect to r as 


OK (r, t) OK(r,t) | (r+t 
po eg es Fae (econ 
ér on 4 


(r—1)/2 (rt+t)/2 


ey | anv (S40) | du,-, (14) 
n=1 
0 0 


. JocnaY Gen + beas)on | V(@)de. 
Q vy 


Further we formulate dg(r, k)/ér from (9) and replace 8K(r, t)/dr by the 
series of (14); integrating by parts, we get 


ae 


k( vos kr + | K (r, t) cos kat) 
0 
(r+1)/2 


+1 = +! 
+afr(S ) sini +43Y¥ sin mt} | anv (“F+»,)f 
n=1 
0 0 


(rt+t)/2 (r—t)/2 


du,- | diy1V (tyes + taai) | V(@ae. (15) 


dp(r,k) _ 
oro 


Next we consider the expression 


1 dp(r,k) 
kK Or 


+ ig(r, | ew 


for roo. It can be shown quite simply from (9) and (15) that this limit is 
A+ | Ko(u) e™ du, 
0 


in which K,(u) is the limit of K(r, t) as r and ¢ tend to infinity along the 
straight line r—t=u>0. On the other hand, the initial conditions of (3) 
and the asymptotic expression of (4) for g(r, &) together imply that this 
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limit is 
A(k) exp[id(k)], 
so that 
1+ [ K (u) e~ ™ du = A(k) e®™ = W(k). (16) 
0 


K(r, t) may be produced with respect to ¢ in the odd way fort<0(—r< 
t <0), while y(k) may be produced for k<O in a way such that 


A(k)=A(—k) and 6(k)=—6(—k). 


It can be shown? that (10) and (11) together have a unique solution within 
the triangle —r<t<0, r>0 for which 


K(r,t)=— K(r,-—t) and K(r,t)> Ko(u) 


when r and ¢ tend to infinity along the straight line u=r—t. 

This shows that it is possible to find K(r, ¢) if Ko(u) is known, and 
further to deduce V(r) from (11). 

Unfortunately, experience has revealed little practical value in this 
mathematically very striking method of structure deduction from particle 
scattering in the S state (or in any other state with a definite momentum /) 
by reference to the phase 6,(k). Essentially, experiment gives us 6(k) only 
for a finite range 0<k<k,,,,, not for the full range O0<k<o, whereas 
the behaviour of V(r) is very sensitive to the behaviour of 6(k) at large k. 
This feature reflects a major law of optics: details in the object of scale a 
cannot be seen if the wavelength 4 of the “‘light’”’ is greater than a. 

We must have Z less than a, i.e. k>1/a, in order to examine the form of 
V(r) for rxa, but 6(k) for elastic scattering becomes small for large k, 
and the simpler Born method may be applied. 

In practice, attempts to examine nucleon interaction potentials en- 
counter the difficulty that mesons start to be produced and relativistic 
effects become prominent for large k. The radius of action of nuclear 
forces is comparable with the Compton length a=f/mc of the meson 
(m is the meson’s mass and c is the velocity of light). Hence a study of 
V(r) as a nucleon interaction potential requires 


1 °1 #=me 
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l.e. that the relative momentum of the nucleons is 
p=hk>mce. 


This implies that the nucleon energy is 


2p 
2E = -- > —mc* = 2mc’, 
M 7 
mec 
v>— =e 


and their velocity is v>mc/m=c/f. This represents the threshold for 
meson production and also the range where relativistic effects become 
prominent, and neither of these effects has been incorporated in the non- 
relativistic theory presented here. 


REFERENCES 


. M. Gel’fand and B. M. Levitan, Izv. AN SSSR, ser. matem. 15 (1951) 309. 
. K. Mel’nikov, Uspekhi matem. nauk 14 (1959) 121. 
. G. Krein, Dokl. Akademii Nauk SSSR, 105, 3 (1955). 
. Jost and W. Kohn, Phys. Rev. 87, no. 6 (1952). 


63 


CHAPTER XI 


A MEASURING INSTRUMENTISA 
MACROSCOPIC DEVICE 


Quantum mechanics resembles classical mechanics in that the micro- 
systems are assigned various characteristic dynamic variables. Moreover, 
the situation may be formulated very crudely by saying that these 
variables have analogues in classical mechanics, such as particle co- 
ordinates, momentum, and total energy. There are exceptions in the case 
of certain specifically quantum variables, which have no analogues at all 
in classical theory, or else ones of very little significance, which include 
the particle spin 6 and the isotopic spin t. Most dynamic variables retain 
their significance in both forms of mechanics, at least in the sense that 
they become identical when the phenomena occur in a region where the 
two theories overlap. Formally such a region is characterized by the fact 
that any observable quantity L may be represented as a power series in 
Planck’s constant A: 


L=Lo(p.a) + ¥, W-a,(0,9), (1) 


in which the first term corresponds to the classical value of L. 

Some phenomena make this expansion impossible, and here the 
dynamic variables split up into complementary sets: space-time ones (q) 
and momentum-energy ones (p). This split into two complementary 
classes is a fundamental feature of quantum mechanics and results in an 
irremovable uncertainty in quantum ensembles. This may be illustrated 
as follows. Suppose that we have a classical ensemble with a random 
spread in some mechanical quantity L, the variance of this being 


AL =(L—L) >0 (2) 


the mean represented by the bar being taken over the ensemble and L 
being the mean of L. This quantity is measured, and we select specimens 
having the same L, say equal to L’, from which we construct a new 
ensemble, in which AL? =0. Speaking generally, this process of selection 
has no effect on the state of the system as regards other dynamic variables, 
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e.g. the variable M. Proceeding as above, we produce an ensemble in 
which AL =0, AM?=0, .... 1e. one in which each of the dynamic 
variables has a completely definite value. In particular, such an ensemble 
might be built up from classical systems whose temperature is absolute 
zero, in which case the momentum p=0, while the coordinate has the 
value ¢=Qmin Corresponding to minimum potential energy; appropriate 
choice of origin allows us to put g=0. All the other quantities are 
functions of p and q, so they too have definite values corresponding to 
p=0 and q=0. 

Nothing comparable occurs in the region ruled by quantum mechanics. 
A quantum ensemble with a definite value for some mechanical quantity 
L(AL? =0) can never be one in which every other dynamic variable also 
has definite value. Let L be represented by the operator #, with some 
other mechanical quantity M represented by operator -4; Schwarz’s 
inequality, which applies!>? to the linear and self-conjugate operators # 
and 4, gives us that 

AM? AL? >14I\C/, (3) 

in which C= #A€L— LL and the bar denotes a mean over the ensemble. 
This is the uncertainty principle in its most general form. In particular, 
if M and L represent the canonically conjugate momenta p and coordinate 
g, and we have for the operators Y and 2 that 


PQ -— IP =ih, (4) 
then (2) implies that 

Ap?- Aq? > ih’, (5) 
i.e. the uncertainty relation for p and q. Also, (3) shows that only the 
quantities represented by commutating operators 


ML-LM=0, (6) 


have unrelated variances 4M? and AI’, and hence may be chosen as 
specimens of microsystems in an ensemble in such a way that the new 
ensemble has both variances zero. Quantities represented by non- 


commutating operators: 
ML- LM #0, (7) 


cannot give an ensemble with 4M?=0 and Al? =0, apart from exceptional 
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states of the initial ensemble such that Cy =0. Hence an ensemble with a 
definite value for some dynamic variable L represented by operator 2 
always allows us find dynamic variables M represented by operator 4 


that does not commutate with Y. This means that 4M?0 in this en- 
semble, in accordance with the general uncertainty relation (3). An 
attempt to eliminate the variance of M by selecting specimens having a 
definite value M’ will give rise to a new random set in which, by virtue 


of (3), there will be a spread in L(AL’ no longer zero). Randomness 
cannot be eliminated in the quantum region. This feature gives rise to an 
extremely basic requirement in respect to the working principle of any 
measuring instrument designed for measurements on a quantum ensemble. 
Only external measuring instruments can serve to detect ensembles and 
to analyse distributions in them (such instruments must stand outside the 
ensembles) in any study of the phenomena by statistical methods; they 
must be free from the elements of randomness inherent in the statistical 
sets examined by their use. However, a measuring instrument resembles 
any other body in consisting of atoms, molecules, and similar micro- 
formations, which perform random chaotic movements, as observed in 
Brownian motion. Before the era of quantum mechanics it was assumed 
that this random molecular motion could be frozen, but we know that 
this motion does not stop even at absolute zero; the random fluctuations 
in the positions of the microparticles persist even in an absolutely cold 
body. The world of microparticles shows too lively disordered motion 
for any part of it to be used as a measuring instrument; such a micro- 
instrument would be permanently subject to numerous random dis- 
turbances and so itself would need to be checked continually. We need 
to find an island of peace in the chaos of the microworld, in the stormy 
sea of microphenomena. Quantum mechanics finds this island in a 
macroscopic instrument. 

The output of any instrument is always a macroscopic effect: deflection 
of a pointer in a meter, formation of droplets in a Wilson cloud chamber, 
blackening of grains in a photographic emulsion, and so on. Quantum 
theory identifies this concept of macroscopicity with that of a classical 
nature in the instrument; in other words, the instrument must be so 
constructed that its action ultimately depends solely on its macroscopic 
features, i.e. ones in which Planck’s constant plays no part. Such an 
instrument is? maximally free from quantum randomness, so we may 
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correctly say that quantum mechanics examines the microworld in its 
relation to the macroworld. Macroscopic (classical) instruments are 
recording systems in relation to which the state of the microsystems is 
determined in quantum mechanics. 

Actually, such a recording system may be realized only approximately, 
but in this respect there is no difference from the classical position. For 
instance, in classical mechanics we would say “The deductions are 
strictly correct only for measurements with absolutely rigid measures 
and uniformly running clocks’’. If these do not in fact exist, we find some 
devices approximating to them such that the theoretical deductions still 
apply to the real situations. 

Now let us make a few comments on the position of man himself as a 
measuring instrument. There is no doubt that here we enter very tricky 
ground, since in the current ground state of our knowledge we cannot 
analyse man with the clarity and completeness possible for a physical 
instrument. For example, we cannot say in respect of living matter in 
which cases we can ignore Planck’s constant and in which we cannot. 
We must therefore resort to less precise evidence and criteria. 

A major feature is that the sense organs respond in a smoothed fashion 
to microevents, i.e. in a macroscopic way. For example, we sense heat or 
cold, the sensation being produced by microprocesses (random motion 
of the molecules in the surrounding medium) but with the fluctuations 
averaged out. The impacts of the individual molecules are not sensed, 
and we cannot judge the fate of single molecules. The same may be said 
of the perception of light; even under the limiting conditions of Vavilov’s 
experiments‘, the eye merely establishes the fact that the intensity 
fluctuates at extremely low levels. Here also the final process within the 
man is of the ordered macroscopic type; the observer makes a note of the 
observed fluctuation, and in this respect he does not differ from a particle 
counter that moves its pointer. We are not even able to say how suitably 
equipped living matter would sense the direct motion of microparticles 
that it was capable of perceiving. The consciousness of such a being 
would perform some kind of Brownian motion. Should we envy such a 
capacity or regret such a misfortune? No reliable answer can be given 
even to this question. 

The chaos of microphenomena appears as disproportionately large 
fluctuations in the chain reaction in a nuclear reactor brought into 
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operation at very low mean power; such fluctuations greatly exceed the 
mean level. This reflection of atomic randomness can well get on the 
nerves of operators participating in the startup, since it may seem that 
the process will get out of control. However, these disturbing fluctuations 
are smoothed out when the power level becomes high. We cannot escape 
the impression that human perception is better adapted to the smooth 
course of events in the macroscopic world. 

Thus although we cannot apply to man the simple criterion of macro- 
scopicity used in quantum mechanics, we can say that human perception 
smooths out the atomicity of the world and in this respect is macroscopic. 

Man must be considered as a macroscopic instrument from the view- 
point of physics, but he is far from being perfect in that respect, especially 
because he needs an extension of the facilities provided by his senses in 
order to study the microworld. The extension is provided by measuring 
instruments, which allow microphenomena to speak in the language of 
macrophenomena. This has given rise to the frequently debated question 
of how far this approach “restricts” our perception of the microworld. 

Whilst this aspect will not be considered here, because it deals with 
epistemology rather than physics, it is difficult to avoid a few comments. 

It is an old and well known truth that the criterion of the reality of 
perception is experience. However, the detailed mechanism of perception 
may vary greatly from one living being to another. This mechanism 
possesses certain features common to all beings, even though it may 
appear paradoxical from the viewpoint of man. These common features 
may perhaps be best expressed in the language of cybernetics: conscious- 
ness (general knowledge) gives a command to an effector organ of the 
living matter (or to the machines or apparatus at its disposal), the effector 
produces an action, and after this action the consciousness receives a 
return signal, communicating to it the result of the action. Feedback of 
this kind stimulates the consciousness to further action. Perception of the 
external world is based on this feedback, whose precise form may vary 
greatly. It is clear that if the feedback operates incorrectly (does not 
reflect reality), then no organism or a whole group saddled with this 
deficiency could exist for any prolonged period. However, the mechanisms 
of “reflection” may vary greatly. For example, in the devices enabling 
man to see at night infrared is converted into visible light; the colour of an 
object may be transmitted as a colour signal (by a photochemical action 
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in the eye), or as a sound if the light is converted to sound by a photocell 
and then passed to the organs of hearing. 

Iolanthe, blind from birth, would have found a way of distinguishing 
a red rose from a white one if she had been a physicist. 

These examples show that the important thing is not the precise method 
of perceiving the signal received from the object but that the signal 
should be unambiguous. 

For this reason we should not belittle quantum mechanics simply 
because it leads us to hear the music of the microworld in the language of 
macroscopic effects, especially since experience in applied physics and in 
atomic physics research shows that this language is never an obstacle to 
scientific and technical progress. Could man even perceive the world in a 
way different from that in which he does at present (i.e. as an essentially 
macroscopic world) without essential alteration of his organism? 
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CHAPTER XII 
SCHEME OF A MACROSCOPIC INSTRUMENT 


Here we return to the theory, especially as regards the meaning of 
measurement from the theoretical point of view. We assume that we have 
to measure the dynamic variable L associated with a microsystem yp, 
which itself belongs to a quantum ensemble governed by a macrosetting 
M. For simplicity, the ensemble is assumed pure, so that the state of the 
microsystem is described by the wave function W,,(q). The variable L is 
represented by the operator # and has the proper values L,, Lo, Ls, 
..., L, together with the corresponding proper functions W,(q), ¥2(q); 
.-> W,n(q). Then Wy(q) may be represented as a spectral expansion with 
respect to the proper functions of #: 


bul) = ¥, Cala). (1) 


The set of coefficients C,, C2,..., C, should be considered as ,, taken in 
the Z representation, i.e. in the frame of reference of the dynamic variable 
L. Here it is in order to use the notation 


C, = Pu (L,) ? (2) 


which stresses this feature explicitly, but we shall employ the simpler C,, 
notation, with the subscript m having two meanings: to denote the proper 
value L, of L and to denote the macrosetting that produces this value. 

We assume that the measurement has been performed and that we have 
found that L has the value L,,; after this measurement the system will 
then belong to a new ensemble, in which L has only the value L,,, i.e. 
AL =0 (if AL? £0 before the measurement). The corresponding state of 
the system is now w,,(q) and will relate to the new macrosetting resulting 
from the measurement. 

Comparison of the new state ,,(q) with the starting w,,(q) shows that, 
in mathematical terms, the spectrum of (1) has been reduced to the single 
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function w,,(q): 
2 Cnitn(@) > Wm (Q)- (3) 


This process is termed contraction of the wave packet or contraction 
of the wave function. The coefficients C,, in the expansion determine the 
intensities |C,|? of the particular states ,(q) in the superposition of (1) 
and hence determine the probability that a measurement of L will give the 
value L,,, i.e. that measurement of L causes W,,(q) to contract to wW,,(q) 
(see any textbook on quantum mechanics). 

This is the purely formal side of the matter; let us now consider some 
examples showing how the process occurs under actual conditions. 


A. ANALYSIS OF A POLARIZED BEAM 


Consider a light source S (Figure 7) working into a collimator C and a 
polarizer Y, the result being a polarized beam propagating along the Ox 
axis. These parts of the macrosystem may be considered as preparing the 
microsystem (here a light quantum) in a definite pure state wa,, in which 
M represents the macrosetting (including the light source, collimator, 
polarizer, possible diaphragms, and so on; in particular, it includes the 
fact that the path of the beam contains no other microscopic or macro- 
scopic bodies that could affect the propagation). The state wy of the 


Fig. 7. Experiment with a polarized light beam. S is a light source, C is a collimator, 

and P is a polarizer, the last two together forming the preparative part of the equip- 

ment. This gives the ensemble war. A is an analyser which produces the two beams w1 

and we, while ¢1 and ¢2 are photon detectors. Thus S + C + P=M,A+ $1+ ¢2= 
7 (instrument). 
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linearly polarized light may be represented as the superposition of two 
mutually perpendicular states of polarization, say , and 2: 


Wu =CywW,+ CW. (4) 


in which C,=cos$ and C,=sin9, with 3 being the angle between the 
polarization direction of the beam M and the polarization direction in 
state 1. The spectral expansion of (3) is effected in practice by placing at 
some distance from ¥ an analyser A extracting light beams having states 
of polarization 1 and 2 lying in different directions. These beams fall on 
the detectors ¢, and @,, which respond to the arrival of a light quantum 
in one of the two possible directions, wy, or w,. The detector may be a 
photomultiplier, which is in principle capable of recording a single photon 
to give rise to an electron avalanche (a macroscopically recorded electrical 
pulse). The channel in which the pulse occurs will show whether the 
photon had a polarization of 1 or 2. The numbers N, and N, of the 
photons in the two channels are given by (3) as follows for the case of 
measurements repeated many times: 


—-—. = cot*v. (5) 


This system is termed a measuring instrument; it has two major parts, 
whose purposes differ: the analyser, which performs the spectral de- 
composition of (4), and the detectors ¢, and @, (here photomultipliers). 
It is clear that the fluxes Y, and w, must be laterally bounded in order to 
allow the detectors to function properly, otherwise ¢, and ¢, would 
confuse the assignment of a quantum to a particular beam. 

It is clear that the entire device forms part of the macrosetting; the 
possibility of dividing the latter into a “‘preparative’’ part (usually denoted 
by M) and a “measuring” part is due to the spatial separation of M from 
A+, +92. 


B. MOMENTUM MEASUREMENT 
Figure 8 shows schematically another experiment, in which we have a 
particle source S (the particles may be exposed to external electro- 


magnetic fields e) and collimating diaphragms C, which collimate the 
beam of free particles leaving the region of action of «. 
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Fig. 8. Momentum measurement with a diffraction grating D; S is particle source, 
and C is the diaphragm. The preparative part is M = S + C, while D splits war into 
a spectrum wi, we, wa, ...; 61, p2, $3, ... are particle detectors. J = D+ $1 + ¢2+-.... 


At the exit from the collimator we have a transversely bounded electron 
beam with generally undefined momentum p. There are two possible 
causes for this uncertainty: the source may not provide particles of 
identical momentum (in which case we have a mixed ensemble; we assume 
that this cause is unimportant, the resulting spread in p thus being small), 
and the particles are exposed to the external varying fields ¢. In the latter 
case the ensemble may be pure but may have a spread in momentum. 
This is the case that we shall consider, on the assumption that the pure 
ensemble is described by the wave function Wy(x, y), which is resolved 
spectrally with respect to the proper functions of the momentum operator 


Wy = (x) = exp (ipx/h)/(2nh)* 


YW (xy) = 2, 4,(y) W(x), (6) 


as follows: 


the lateral restriction of the beam being incorporated by assuming that 
the coefficients a,(y) are slowly varying functions of the y-coordinate 
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and vanish for | y|>d/2, in which d is the beam width. This whole part of 
the macrosetting may be considered as the “preparatory” starting 
ensemble y,,.. 

The beam falls on the diffraction grating D, which produces a spatial 
separation of beams differing in momentum, i.e. it resolves the primary 
wave Wq,(x, y) into a spectrum. 

This grating also alters the beam direction and, in essence, alters the 
initial ensemble: the initial state ,,(x, y) is converted by D into a mixture 
of the primary wave W(x, y) and scattered waves b,(x, y) Wp,(x, y), 
which are shown as separate beams w,,(x, y) in the figure: 


W(X, ¥) > Wea (xX y) = Wu (x; y) + i b, (x, y) Wp, (Xs y)s (7) 


in which the sum is taken with respect to the beams ,.(x, y) as scattered 
by the grating. The spatial separation of the beams causes the coefficients 
for the various p, to satisfy 


b, (x, y)* by (x, y) = 0. (8) 


This spectral decomposition may be performed far from the “‘preparative”’ 
section, in which case the macrosetting may be divided into a “preparative”’ 
part and an “instrument” (here the grating plus detectors @,, @5,...) 
that detects the particles in a given beam after scattering. 

The localization of a particle in a particular detector allows us to 
evaluate the momentum before the grating interfered with it, i.e. before 
“the instrument affected the state of the object’. The detector in this 
example may be a counter in which an electron triggers an electrical 
discharge i.e. which produces a macroscopic effect. Alternatively, the 
detector might be a photographic plate, where the electron initiates a 
chemical reaction in a grain of the emulsion and so causes the macro- 
scopic effect known as a latent image. 

This example shows that a diffraction grating that produces spatial 
separation of beams differing in momentum also destroys the interference 
between these, while a particle that is led to trigger a detector in a 
particular channel indicates its former momentum (i.e. its momentum 
before scattering). The physical essence of the contraction of the wave 
packet is thus that the microparticle alters the state of the macroscopic 
instrument. 
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C. DETERMINATION OF THE QUANTUM STATE OF AN ATOM 


Consider a beam of atoms emerging along the Ox axis from the “‘black 
box” S (Figure 9). Neglect the detailed structure of the box and merely 
note that the width of the beam 4Z, produced by the collimator dia- 
phragm D is such that the momentum p, along the Oz axis (perpendicular 
to the Ox axis) has the following uncertainty: 


h 
Ap, >——-. 9 
P 2-AZ, (9) 


The momentum along Ox may be taken as being defined with unlimited 
precision as p,Mv, in which M is the mass of the atom, which will be 
taken as fairly large. We also shall assume that the atoms escaping from 
the black box may differ in their internal states, which are described by 
the wave functions w,(x), where x indicates the internal coordinates of 
the atom, e.g. the coordinates of its electrons. The energies of these states 
are E,,. Moreover, the electric or magnetic moment of the atom may vary 
with the state, in which case an inhomogeneous electric or magnetic field 
allows us to divide the beam into beams in each of which the atoms are 


Fig. 9. Measurement of the internal energy of an atom (Stern-Gerlach experiment). 
Sis a “black box”, D is a diaphragm, M = S + D prepares ya. NS is a magnet (the 
analyser), P is a plate (detector), J = NS + P. $1, $2, $3, ... are the separated beams. 
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in some definite internal state w,(x). This division occurs for example 
in the Stern-Gerlach experiment on the magnetic moment of the atom 
(here we note that what is measured is ?»3 actually the energy of the atom 
in the external magnetic field H). The internal energy FE, of the atom 
becomes dependent on the position in space when the atom enters this 
external inhomogeneous field: 


E, > E,(X, Y,Z), (10) 


in which X¥, Y, Z denote the coordinates of the centre of gravity of the 
atom; in particular, since the states m have different magnetic moments 
M,, we have 

E,(X, Y,Z)=E£, —-4,H, (11) 


in which .4, is the projection of the atomic magnetic moment on the 
direction of H, this being a function of XY, Y, Z. The problem may be made 
more concrete by assuming this to be directed along the Oz axis and de- 
pendent on Z. The projection -#%, equals the Bohr magneton ef/2pc 
multiplied by some numerical factor g, which depends on the structure of 
the atom (the Landé factor”). A small H does not alter the internal 
structure of the atom, so the wave functions ,(x) in this approximation 
may be taken as independent of the position of the atom in space, i.e. of 
X, Y, Z. These functions form a complete set, so the wave function 
Y(X, Y, Z,x) for the atom as a whole is dependent on the internal 
coordinates x and on the coordinates X, Y, Z of the centre of gravity of 
the atom; the latter function may thus be expanded as a series in w,,(x): 


Pu l(X, Y,Z, x) = > V(X, Y, Z) W(x) : (12) 


By representing the state of the atoms in our ensemble as a super- 
position of the states w,,(x) we explicitly assume that the “‘black box” 


prepares a pure ensemble Y,,, but with the energy E, uncertain (AE?# 0). 
This “black box” is the preparative part of the macrosetting. The 
measuring instrument consists of the magnet NS, which produces the 
inhomogeneous field H (itself dependent on Z) along the OZ axis, 
together with the detectors ¢,, ¢>,...; as we shall see later, the atom 
itself here forms part of the measuring instrument. ¥(X, Y, Z, x) must 
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satisfy Schrédinger’s equation: 
ih iad i 13 
ih— = : 
1 (13) 
KH (X,Y,Z,x) =T (X, Y,Z) + Ho(x, X,Y,Z), 


in which #9(x, X, Y, Z) is the hamiltonian describing the internal motion 
of the atom. In the presence of an external field we have 


H o(x,X,Y,Z) Wax) = E,(X, Y,Z)-W,(x); (14) 


in which we have used the assumption that the external field does not 
deform the atom, i.e. that the w,(x) remain unchanged. Further, 
T(X, Y, Z) is the hamiltonian for the motion of the centre of gravity; 
there is no interaction between the atom and the external field apart from 
that due to the magnetic (or electric) moment of the atom, so T(X, Y, Z) 
is simply the hamiltonian for the kinetic energy of the atom as a whole. 

Substituting (12) into (13), multiplying the result by one of the W*(x) 
functions, and integrating with respect to the internal variables (x), we 
obtain from the condition that the functions are orthogonal, 


| VRC) val) bx = Fyn (15) 
a system of equations for ¥,,(X, Y, Z): 
ov 


Here the energy E,,(X, Y, Z) of the atom plays the part of the potential 
energy for the motion of the atom as a whole. H is macroscopic in 
character, so it changes smoothly on the scale of the wavelength 4=//Mv 
of the atom: 20H/0Z<H. This condition is simply the condition of 
applicability of classical mechanics to the motion of the atom as a whole. 
Therefore we put ¥,,(X, Y, Z) in the form 


W(X, Y,Z) = x/Pm(X, Y,Z) exp F S,(X, Y, 2)]. (17) 


in which S,,(X, Y, Z) is the classical action function, while p,,(X, Y, Z) 
is the density of the atoms in (X, Y, Z) space. We substitute (17) into (16) 
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and neglect higher powers of 4; separating the real and imaginary parts, 
we have 


os 1 
Pn wm (VS q)? + Em (Xs YZ) Sms (18) 
OPm 1 ‘ 


The first of these is the Hamilton-Jacobi equation for the action 
function S, while the second is the equation of continuity for the density 
Pm in beam m; —(1/M)VS,, is simply the velocity v,, of the atoms in the 
beam; thus this equation states that the particles will move in such a way 
that their flux is constant in any cross-section of the tube formed by the 
trajectories of the particles. We do not need to solve these equations; it is 
simpler, in view of the classical character of the motion, to use Newton’s 
equations directly: 


M -—,-=0, M—"=0, M-—~=-——. (19) 


These equations give 
Xn=0t+Xo, Yn=Yo. Zm=—o— tt? +Zp, (19’) 


in which Xo, Yo, Zp are the initial values of the coordinates X, Y, Z; v is 
the velocity of the atom along the OX axis. It must be remembered that 
this solution is approximate, for the atoms that pass through D will not 
move along the classical loci; the beam will spread out. This quantum 
effect may be allowed for by taking a further step in the approximate 
solution of (16) by incorporating terms containing / to the first power. 
We shall not do this here, but merely limit ourselves to estimates. The 
uncertainty relation (9) implies that the beam width will increase in the 
OZ direction. 
Thus, from (9) it follows that 
h 
vb, > wig ep 9 
2M AZ, 

and so that 


h 
AZ, = v,t > ——'t 
2MAZ, 
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Beams relating to different internal states n and m will be separated 
provided that 


(En 0E,)| t? ht 
ES es (Seiten poe — > a 

|6Z =dZ|2M > 2MAZ, 

OE,, OE, 2) 

——— —|AZy:t>h 

OZ OZ 
Since E,, varies only slightly with Z, we may put 

|E, = E54 t>h > (21) 


which is the usual relation between the energy E of the state under 
measurement and the duration ¢ of that measurement; this relation should 
not* be confused with the uncertainty relation for p and q. 

Consider now the operation of the detectors ,, @2,... receiving the 
beams. 

In what follows it is convenient to assume that the functions ¥,(X, 
Y, Z) for the motion of the centre of gravity are normalized to 1 through- 
out the (X, Y, Z) space; for this purpose it is sufficient to put 


W(X, Y,Z) =C,¢,(X, Y, Z), 
whereupon the superposition of (12) becomes 


W(X, Y,Z) =¥, Caba(X, Y,Z) W(x). (22) 


This implies that the probability that the atom lies near the point X, Y, Z 
iS 


W(X, Y,Z) dX dY¥ dZ= | |W, (X, Y,Z,x)| dx 
=> IC, 1¢,(X, Y,Z)/? dX d¥ dZ. (23) 


We shall consider now the probability that the atom is in beam m, 
which is equivalent to its having an energy E£,,; (23) shows that this 
probability is the integral of W(X, Y, Z) over the volume V,, of beam m. 
If the beams for the various m do not overlap, which is possible if (21) is 
obeyed, the integral is 


W, =F ICAI? { 0,(X,¥,Z)2 dX d¥ dZ=|C,I?, (24) 


Vm 
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in complete agreement with the interpretation of the coefficients C,, as 
indicating the relative participation of the states w,, in the initial ensemble 
Wy of (22). 

The detectors record the occurrence of the atom in a particular beam. 
The detector may be a ‘‘cold”’ plate placed in the path of the beam (plates 
#1, o2,--. in Figure 9). An atom falling on such a plate gives up its energy 
and is absorbed. 

It would be simple to show that the flux at the plate in each beam is 
proportional to the probability W,,. Hence the concentration of the 
deposited atoms on the plate may be measured to define |C,,|?. This 
instrument is “‘classical’’ and macroscopic not only because the fields 
are macroscopic but also because of the classical character of the motion 
of the centre of gravity of the atom itself. In this example the atom is 
used as part of the macroscopic devices intended to elucidate the internal 
state of the atom. An atom is a relatively heavy system and does not 
produce marked diffraction effects; it moves along a nearly classical 
path and sticks to the cold plate at a well-defined point, so that the 
sticking of the atom in this example is itself a macroscopic effect, although 
not on a very large scale. 

In this example we again find the previous characteristic features of the 
measurement process: macroscopic fields and macroscopic motion of the 
centre of gravity disrupt the interference between the different states 
w,(x) forming the initial quantum ensemble ¥,,(X, Y, Z, x). Sticking of 
a heavy atom at a definite spot on the plate serves as the basis for 
“contracting” the initial wave function x of (22) to one of the particu- 
Jar states w(x): Vy, 
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THE THEORY OF MEASUREMENT 


The essence of a measuring instrument J may be expressed as 
I=A+D 


in which A denotes an analyser and D a detector. 

Previously we have considered mainly the theory of the analyser and 
left the description of the detector on one side. In particular, we have 
described in detail the action of an inhomogeneous magnetic field on the 
motion of an atom, which in the example given (Chapter XII) served as 
the classical mechanism which performs the spectral resolution of the 
initial ensemble. We have not dealt in detail with the theory of the 
diffraction grating that resolves the ensemble with respect to p, simply 
because that theory is well known and is largely the theory of Bragg 
reflection. For the same reason we did not go into the mathematical 
theory of the light polarizer in the example on the resolution of light 
waves into spatially separate polarized beams. That theory would, strictly 
speaking, be even more out of place in this book, for it relates to the 
theory of electromagnetic fields rather than to quantum mechanics proper. 

On the other hand, we avoided mathematical description of the 
operation of the detector for quite opposite reasons: the detailed mode of 
action of the detector is usually very complex and is best considered as a 
separate topic; this topic forms the subject of this section. The complexity 
is probably the reason why the theory of the detector is commonly not 
presented in textbooks on quantum mechanics, but this major omission is 
the source of many misunderstandings and incorrect conceptions. 

The mathematical theory of measurement is as follows. The entire 
macroscopic setting Jt may be put as the sum of parts: 


M=4+At+D, 


in which .@ is the part of the macroscopic setting that governs the state of 
the initial ensemble. To avoid a complicated argument, let us assume 
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that the initial ensemble arises in the pure state described by the wave 
function ¥ (x) (here x denotes the set of dynamic variables describing 
the microsystem j). A denotes the part of that setting which acts as the 
analyser, which performs the spectral decomposition of ¥y,(x) into the 
pure states of the relevant dynamic variable, here taken as L. 

The last is assumed to have a discrete spectrum of proper values 
L,, L2,..., L, together with the corresponding pure states w, (x), W(x), ..., 
W,(x) for the microsystem. We than have the part of the macrosetting 
containing the detector D, which essentially changes its state in response 
to a microparticle. This change is a macroscopic phenomenon. Let Q be 
the dynamic variables of the detector, which are required to describe the 
states and changes in the detector. Since the detector is a macroscopic 
device (in particular, it may be assigned a temperature @) it is best to 
describe it via a density matrix pp(Q, Q’) rather than via a wave function 
¥(Q). In this connection it is also more convenient to describe the micro- 
system jy that interacts with the detector via a density matrix p,(x, x’) 
rather than via a wave function %,(x). The interaction means that the 
two matrices must be combined into one common density matrix, which 
now is a function of time: 


Potu = Po+n(Q,X; Q',x’, t). (1) 


This matrix obeys the equation of motion 


dp 
ae + [%, pp+,] =9, (2) 


H =H y(Q) +H#, (x) + Wou (Q, x), 


in which #,,(x) is the hamiltonian of p,3/(Q) is the same for D, and 
W,,(Q, x) is the operator that describes the interaction. This equation 
is easily written down but is very difficult to solve in cases of interest to us. 
It is sufficient to recall that a detector is usually a very complex macro- 
scopic device, such as the sensitized grains in a photographic emulsion, 
the supercooled vapour in a cloud chamber, the electron avalanche in a 
Geiger counter, etc. All the same, we shall consider later some schematic 
examples of the operation of conventional detectors, while at present we 
shall deal with the operation formally on the basis of the common matrix 
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Po+u(Q, x, Q’, x’), which without restriction of any kind may be put as 


Po+u(Q,x5 Ox’) = DY) Win(X) Pmn(Q; Q') Wn(x’), (3) 
min 
in which the w,(x) are the proper functions of the measured quantity L. 
This matrix is of course non-diagonal with respect to L in the general 
case. 

A measuring instrument that carries out its function well will give rise 
for a time t of adequate length to a situation such that, if the Q lie in the 
range O;<Q<Q" (which may be the position of the needle of a macro- 
scopic instrument at some definite point on its scale), all elements of the 
matrix for t>00 obey 


Pmn(Q, Q’," t)= 0, (3°) 
apart from 

Pm(Q,Q',t) #0, (3”) 
if 

Q,<2,0'<@,. (3"") 


This condition means firstly that the interference between the individual 
states w,,(x) of the microsystem is destroyed, and secondly that the value 
L=L,, of the dynamic variable under measurement corresponds uniquely 
to the state QO of the detector. This theory is the most general scheme for a 
measurement in quantum mechanics, as it includes not only the micro- 
system but also both parts of the macroscopic instrument (analyser and 
detector). 

Conditions (3’), (3”), and (3”) may be considered as technical conditions 
specifying a good macroscopic instrument. 


A. DETERMINATION OF THE INTERNAL STATE OF AN ATOM 


Here we consider an example of such a determination by the method of 
deflection in an external field (Chapter XIJJ), with allowance for the 
operation of the detector. 

We must now supplement our system of atom + electron with a detector, 
which in this case is a “‘cold’’ plate that adsorbs the atom at its surface. 
Let q denote a set of dynamic variables describing this plate, e.g. the 
normal coordinates describing the vibration of its atoms. The plate has a 
temperature 0. The complete density matrix describing atom + electron + 
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detector may be put as 
p(Q, X54; 0’, x; q', 0, t) = » wr; t): Wm (X)* Pmn (4 q',9, t) 
¥,(Q',t)Wr(x’), (4) 

in which ¥,(Q, t) and W,(x) have the meanings of Chapter XII; ,(x) 
describes the state of the electrons in the atom, while ¥,,(Q, t) describes 
the state of motion of the atom as a whole; the matrix element p,,,(q, 9’; 
0, t) describes the state of the detector D. 

The external field separates spatially the beams described by the 


functions Y,(Q), in a way such that the matrix (4) over a sufficiently 
long period t goes over to 


2(Q,x,q3 Q',x',4' 9, tra = > YEO) I Wn (I? Pan (4, 9, 9, 1). (5) 


In this matrix ¥,(Q) has the form 


v,(Q) = exp| - o 2] n(2—2), (6) 


2a? a 


and describes the state of an atom adsorbed on the plate near the point 
Q=Q,. This atom vibrates about that position with an amplitude 
a=(h/2Ma,)*, in which M is the mass of the atom and @ is the vibration 
frequency. The matrix element p,,(q, q’, 0, t) describes the excited state 
of the lattice of atoms in the plate that arises from the transfer of the 
energy of the adsorbed atom to the atoms on the plate. The fact that the 
function describing the n-th beam has become ¥,,(Q) means that the 
“‘pointer’’ of the instrument (here the heavy atom) has taken up a definite 
position on the scale of the instrument (on the “‘cold”’ plate). The general 
theory developed in the previous section shows that the spatial separation 
of the beams implies that all terms in the matrix of (5) are zero, except the 
n-th, for a given coordinate of the atom adsorbed around Q~Q,+a. 

Adequate performance in our detector requires that each atom falling 
on the plate from the beam should be adsorbed there. A necessary condi- 
tion for such adsorption is that the temperature 0 of the plate should be low. 

Under these conditions the plate+atom system is unstable in the sense 
that the plate is heated by the energy received from the incident atom 
(the entropy of the system is increased). The reverse process is improbable, 
since it would be accompanied by a reduction in the entropy. 
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B. MOMENTUM OF A MICROPARTICLE DETERMINED BY 
INTERACTION WITH A MACROSCOPIC BODY 


Consider now a simple but somewhat formal scheme! for the de- 
termination of the momentum k of a microparticle yz from the latter’s 
interaction with a macroscopic body. 

It is clear that this body must be in a state of unstable (or nearly 
unstable) equilibrium, as otherwise the microparticle would be unable to 
“disturb” it. As our body we take a sphere of mass M whose centre of 
gravity has a coordinate Q and whose potential energy U(Q) is of the 
form shown in Figure 8. The sphere thus lies at the peak in U, and its 
relative stability there is due to a minor relative minimum in U(Q). 
Acquirement of a small amount of energy 4E>U,)—Ep, will cause the 
sphere to roll down the flank. U(Q) thus takes the form of a high volcano 
with a small crater (Figure 10). This sphere is also a detector, determining 
the direction of the momentum of the microparticle, which may knock the 
sphere to right or left. 

We have greatly simplified the problem by assigning to the sphere only 
the one degree of freedom Q, so it will be more convenient to use wave 
functions (rather than density matrices) to deal with the problem. 


Fig. 10, Representation of the potential energy U(Q) of a heavy sphere M, Fo being 
the initial state. The wave function wo(Q), ¢7(Q) and wo(¢) are also shown. 
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We assume that at the start (¢‘=0) the microparticle p is described by 
Wo(E)= Ar e+ A eo, (7) 


in which € is the coordinate of the microparticle (momentum k). This 
means that we assume a pure state, but with an undefined momentum +k. 
The task of our instrument is to determine the sign of the momentum 
(direction of motion of the particle). 

The wave function of the sphere M at t=0 is 


1 2 
%o(Q) = wF exp(— 2). (8) 


in which a=(h/Ma )*, Wo being the vibrational frequency of the sphere 


within the crater. 
The wave function of the whole system (u+M) at t=0 is then 


$(Q; 6,0) = b0(O,6) = %o(Q) Wo(6). (9) 


The hamiltonian for this system is 

oO h? @ 

H (0, €)=- —- — + U(Q)-— -—-;+ W(Q,6), 10 
(Q.2)=— 305 soi tU@)-yaat WOH, 10) 
in which W(Q, &) is the energy of the interaction between M and yp, the 
latter being considered as free and the former having the potential energy 
U(Q). For simplicity we assume W(Q, €) to have the form 


W(Q,6) = 9 5(Q — €) (11) 

and that the wave function $(Q, &, t) at any time t obeys the equation 
é 

ih “ =#(0,£) $. (12) 


This function is sought in the form 


$(Q,6,t) = $0(2,6)+ $* (0,61) + 6 (0, 4,0). (13) 


On the assumption that the coupling constant g is small, we find @* and 
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@ in the first approximation of perturbation theory as 


po (Q, Gs t) = { oe (t) Y 5 (Q) eik’s gan dp’ dk’ 
i (14) 
= | Us) (Q) gik’S 4 iat dp’ dk’, 


Zz 
Q=O) +O, -— Oy —- Op =. (15) 


Here hw, = Lo is the energy of the sphere in its initial state, Ep, =hw,, the 
energy in the final state, P’ being the momentum of the sphere after 
transition to the excited state; hw, =¢, and iw, =e, are the energies of 
the particle before and after interaction with the sphere. 

The form of  (Q, €, t) is analogous. 

Next, we integrate (12) after substituting from (13) and using (14) to get 


1 (e'™ =_ 1) 


Usa (t) = h Ose, on» (16) 


Q Pp 
in which the matrix element on the right is 
Uye,on= 94" | ¥E(Q) eM S(Q-H Yo(Qer™ ade. UD) 


The function ¥,,.(Q) for the sphere in the excited state may be put in the 
semiclassical approximation as 


#(Q)~ Ny ex(* 5, (0)), (1s 


in which N,. is a normalization factor and S,-(Q) is the action function, 
equals approximately to p’Q. For this reason the integral in (17) equals 
the Fourier transform ¥9(«) of %o(Q), with x=p’+k'—k. Hence 


i(wot+ a gA~ 7, t ‘ (1 - e i) 
b* (OQ, 6,1) = eotonn — [xP P07 eae 


W(Q) 8 dp’ dk’. (19) 


Now let P be the momentum of the sphere such as to correspond to 
conservation of energy during the interaction; then (15) shows that this 
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will be so for z/t=0 and 


E 2 
Dy = = Wo + Oy a, , E,=— + const, 
so that 
12 2 Z 
a — eS =—-, (20) 
2M 2M t 
Hence we find that 
Z dz 
P’—-P=--, Slee 
ot vt 
in which v= P/M is the velocity of the sphere. Further, for z~0 
k’? k? Pp”? 
2u 2p 2M 
or 
(k! — k) (k’ +k) = 2pa — =P (21’) 


If %,(Q) is not too sharp (the amplitude is not very small, which will 
occur if the crater is not very deep), the Fourier transform ¥o(p’ +k’ —k) 
of this will differ appreciably from zero only for 


p+k-k=0. (22) 

Then (21) and (22) with M-—0o give us that 
Kk (23) 
P=2k, (24) 


which is as would be expected for a light particle colliding with a heavy 
but weakly bound sphere; the light microparticle is reflected elastically, 
the energy transfer being small (vanishingly small for M00). 
Now let 
Zz 
DAME eee ce 
v 
in such a way that 


dk’ = dq, =q—-P+k+— 
v 
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We insert the new integration variables, g and z, into (19) to obtain 
gA* t|N,I° 
$°(0.6)=9 exo] ilo. +o) | AE 


Poe ras oe ) (25) 


in which 


(C=)- [es ~ |- Z@- o)f-ae. (26) 


If we denote the discontinuous integral by 


—nmi, a<Q, 


. (e=) = (28) = s(=*3=8), (28) 
ut vt ut 
We note that vt>0O for ®*, because then P=2k>0; (28) thus 
implies that J*=—2¢i for vt>Q—€ and Q—&>0, otherwise J* =0. 
Now ¢* contains the factor %9(é), so that only small values of |€| =a are 
important, and the result implies that ¢*(Q, &, t) differs from zero for 
too only in the region 0<Q< +0, i.e. to the right of the rim of the 
crater. This corresponds to the receipt of positive momentum P=2k 
from the microparticle. 
We calculate @~ (Q, €, t) in precisely the same way; here P<0, v<0, 
and J* is replaced by 


= = vi-U+ 
ze ‘)=4(2 e)—(" Q ), a 
vt vt vt 
which differs from zero for vt< Q—€<O. In this case the sphere is ejected 
from the crater to the left. 


+a 
eit? . 
J(a) = | Sa-{™ es (27) 


then 
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The density matrix for our case is then 


e(2,€,0',6,)=9" (0, & 1): 6(0', &, 1) = b0(@, & 1) bo(Q’, €', t) 
+ $5 (Q, & t) $7 (Q', &, 1) + Go(Q, €, t) 6 (Q', &, t) 
+ $**(Q, € t) oo(0', &. D+ *(O, & 1) b0(Q', 6,1) (29) 
+$°*(0, 6.1) ¢ (0,6,)+¢ *@, 6 1) 7 (0, &, 0) 
+$**(0,6 0 ¢7°(0.,6,0+¢ 7,690 ¢7°(', &, 1). 


All terms in this matrix, except the last two, vanish for t-+oo and |QI, 
{O’|>a; that is, the terms containing ¢) vanish as exp(—Q?/a’) or 
exp(—Q’?/a”) for Q, Q’++00, while the interference terms (which 
contain products of the type @**¢7~) vanish for t>00 on account of the 
properties of the functions 7*([Q—€]/vt). Then for too and for 
|O|, |O’|> a we have 


e(2, 2; 2',6,)=$7*(0, 6,1) 6° (Q', &, 2) 
+¢°"(0, 6,6 (08,1), to, |Q|,|O'|>a. (30) 


The macroscopic instrument thus destroys the interference between the 
states of the microparticle: 


Ate™* and Aq ei, 


Also, for Q, Q’-+ +00 we have 


p(Q, é;9', 6, > 6**(, & 1) $7 (Q', &, 2) (31) 
and for Q, Q’- 00 
p(2,¢;0'°.6)7¢ "O,6,) 6 @’, €, 2). G1’) 


The cases of (31) and (31’) correspond to finding the sphere to the right 
or left of the crater respectively. 
We have 


p(Q, 6; Q', ¢, t) 0 


for Q-++00 and QQ’ —oo or Q-+—00 and Q’++0 (this is the case of 
interference between results of observations on the right and left). This is 
as would be expected from a “good” instrument; its “pointer” should 
take up one of the possible definite positions. In our case the heavy sphere 
acts as the pointer. 
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C. THERMODYNAMICALLY UNSTABLE DETECTOR 


The following is a schematic example of such a detector for a micro- 
particle. The latter may be an atom with one valency electron, which thus 
gives the atom as a whole a magnetic moment equal to that of the electron, 
M,6, in which M, is the Bohr magneton and 6(a,, a,, o,) is the Pauli spin 
matrix. The wave function ¥ for the atom may be put as 


Y(Q, x) = %1(Q) Wi (x) + ¥2(Q) Wo(x), (32) 


in which ¥,(Q) and ¥,(Q) are functions describing the motion of the 
atom as a whole; (x) and w,(x) are functions describing the internal 
state of the atom and correspond to the two possible orientations of the 
atomic magnetic moment. 

The magnetic field H is assumed to act along the OZ axis, so w, 
corresponds to the moment lying parallel to the OZ axis, and yw, to the 
moment antiparallel to the axis. This field is also inhomogeneous, and the 
beams differing in orientation of the magnetic moment are already 
spatially separate, so 


¥,(Q) ¥2(Q)=9, (33) 


as described in Chapter XII. Thus we will suppose that the first purpose 
of the measuring instrument [destruction of the interference between 
states W,(x) and w(x), corresponding to different orientations of the 
spin of the valency electron] has already been fulfilled; in other words, 
transmission through the inhomogeneous field leaves us merely to place 
in each beam the appropriate detector, which will record the entry of 
particles corresponding to that beam, i.e. the state of a particle (“state”’ 
in the sense of spin orientation). 

Such a detector might be a large set of oscillators S=1, 2,..., N with 
N— oo; to avoid complexity of notation, we can assume these to be two- 
dimensional (oscillating in the xy-plane). We further assume that oscil- 
lations in x in practice do not interact with those in y. 

This means that the x oscillations may have a temperature 0 distinct 
from that of the y oscillations; thus we assume that in the initial state of 
the detector (before the interaction with ,:) the x modes are coupled to a 
Gibbs thermostat of temperature 0, so these x modes have that temper- 
ature also at t=0, whereas the y modes are assumed to be at absolute 
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zero at t=0. The detector is thus in a thermodynamically unstable state; 
the slightest coupling between the x and y modes will rapidly cause 
transfer of energy from the x modes to the y ones.? 

The temperature rise in the y modes is the macroscopic effect that 
gives proof of the state of the given individual microparticle (in this case 
an atom). 

The operation of such a detector may now be considered mathematic- 
ally. The hamiltonian of the unperturbed system of oscillators is written 
as 


N 
Hy = oy Ho (x,, ys) 7 Eo; (34) 
h? (0? a? Ma? 
Ho (x. Ys) =— aM (= t =) + ar (x? + y?), (35) 


in which Eyg=/iwyN/2 is the zero-point energy, M is the mass of the 
oscillators, and @ , is their natural frequency. These oscillators interact 
with an incident beam microparticle, the interaction energy being 

N 


ul 0 
W=o ), M,¢,—— tho ) o,; , 
s=1 s=1 Os 


(36) 


in which M, is the mechanical momentum of the oscillator and a, is the 
spin matrix for the optical electron in the atom. We have assumed that 
the spins are oriented along the Z axis in the beam, so Mo may be 
replaced by M,o,, in which 


7) 0 0 
M, = — ih Sa. Ssn =— ih ’ 
i(s ay, = 


and o may be replaced simply by 


aul 0 
a= tO 4 


obviously, we take a, as +1 for one detector and —1 for the other. We 
note that 


x,=r,cosy,, J,=r, sing,, (37) 
re=ta/x2 + y?. (37’) 
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Detector D is described via a density matrix p taken in the (x, y) 
representation. By x we understand all the x coordinates of the oscillators 
(X1, X25 +++) Xsy +++) Xp) and by y the corresponding coordinates (y, y2,..., 
Veo +++) Yn); then p may be put as 


p=pl(x,y;x', y’, t). 


The matrix p satisfies the following equation (see Chapter VI): 
dp 
3, + Lio + W, el=0. (38) 


It is more convenient if p is replaced by the matrix 


. iHot iHot 
p =exp ape p exp nae ae | (39) 


We note that [W, H)]=0, as W= W;; replacing p by its expression in 
terms of 6 we obtain 


4, t LW, p]=0. (40) 


In the substitution here for the operator W from (36) we must bear in 
mind the rules for multiplying matrices with continuous rows and columns. 
For this purpose the operator W must be put in matrix form. For instance, 
the operator —ihd/dg should be replaced by 


Q 0 f ut 
Page = th aq) d(q'—q’"). (41) 
The multiplication Pp denotes the following: 


: d ! uN Ul 
(Power = | Pre 0(Q" 4") da" =~ ih." olga") (aN) 


and so on. These simple rules lead the substitution for W in (40) to give 
a simple equation in partial derivatives: 


N 
Op Op op 
Feo) (P+ )\=0. (42) 
ot 09, a9, 


s=1 
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The solution of this is elementary; the general integral is 


p = P(at + Gy, OL + Pa,..., WE + Py ...5 OL + Oy, 
Vis Vay eeey Ves eee) TN) OL +O), OL + Y5,..., OL + Yy..., OL + ON, 
Ee Oe a (43) 
in which 
Fos Pepe Pe aNd: Fishes Ty 


appear as parameters. 

Consider now the initial data for this matrix. To avoid cumbrous 
formulae for the factors, we use as a unit of length the quantity /= 
(h/2Mq@,)*, while 0 is replaced by B=ha /0. All the quantities become 
dimensionless in these units. For t=0 the matrix is 


p(x, ys x y’, 0) = po(x, x’) Pol), y’). 


Making the appropriate assumption that the y modes are at absolute 
zero, we have 


Poly, ¥’) = Co-exp[—4 2X (ye + ¥?7)] (44) 


in which Cy is a constant of normalization, while exp(—y?/2) is the wave 
function describing the zero-point oscillation of oscillator S along the Oy 
axis. 

The p(x, x’) matrix is much more troublesome, because the x’ modes 
are at a temperature @. Here the state is mixed, and the weights of the 


individual states w,,(x) (energies E,,) are 
eo Enld _ p—BEn 


SO p(x, x’) describes an ensemble in equilibrium with a Gibbs thermostat 
at temperature @ and is put in the form 


Po(x, x’) PP Ye Mn Wit (x) Wa (x’) =e 24 (x, x’), (45) 


in which 


Zo(ts x’) = Drew WE (2) (2). (46) 


Here the sum is first taken over all states n having the energy E, and then 
over all states differing in £,. Direct calculation of such a sum is extremely 
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difficult even in the case of oscillators, so we use an indirect approach 
based on the fact that 


H (x) Un (x) = En (*) 


and hence 
F(#) Wn (x) = f (En) Ua (x)- (47) 


in which #(x) is the Hamilton operator for the system and W*(x) is its 
proper function. We can then put (46) in the form 


Zo (x, x’) = Ye PO Wr (x) Val*’) (46') 


and by differentiating with respect to B, we can show that the sum 
Zo(x, x’) satisfies the differential equation 


02g + HZ, =0 (48) 
op oe 
We must replace #(x) by the unperturbed operator for the x modes: 
id 2 
1 od 1 1 
KH (x) = —~—,4+-x?]J--N 49 
o (x) yf seats) 2 ’ ( ) 
s=1 


which is taken from (34) and (35), with allowance for the new units of 
length for x. 

The additivity of the hamiltonian of (49) allows the variables in (48) 
to be separated, so (48) may be solved explicitly for x alone. In this case 
we have 


OZ, (x, x’ 1 07Z4(x, x’ 1 1 
Bei e) o>) ~x? -\ x Z(x, x) =0, (50) 
op 2 Ox 2 2 
The solution is sought in the form 
Zo(x, x’) =exp(a + bx? + C-x-x’ + b- x’) (51) 


subject to the boundary condition 


Za(x, ~ exp(— 55-7) Qs0 B-0 (52) 


which corresponds to evaporation of the oscillators for 9-00; (52) takes 
the form of a sum for particles of an ideal gas. 
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Substitution of (51) into (50) gives 


7) 0b 
aes oe -— = 2p*—1, (53) 
op op 
6c 0b 
2 = ObCs. dC. (53’) 
op op 
This system is compatible and has the solution 
Le? +1 1 
b=--=3= > - =; 54 
2eF 1" 2p — 
a=[(b-4)dp=—)+[bap+—tiogp, (54) 
C? =(4b7 -1), C=—./4b* -1, (54”) 


into which we introduce also the parameter 


C 1 
a ——— aon ee 54” 
aay ‘ 4p2 Or 


These results for a, b, and c show that (52) is satisfied. 
From (51) and (54)-(54”) we may put p(x, x’) in the explicit form 


N 
pols, x) = Corexp |b D) (3? = 2x) + 339)| (55) 
s=1 
Here C,=exp[F(f)] is a normalization factor, in which F(B) is the free 
energy of an oscillator. Using (44), we obtain the complete density matrix 
p for the state of the detector at t=0: 
p(x, y; oe y 0) = CoCo 


N N 
exp E 2, (x? — 2px,x6+ x7) -4 ¥ (y2 + 2)| . (56) 
s= s=1 


To find p at time ¢ we must use (37) and (37’), substituting g,+qt for 
gy, and @, everywhere in (56), in accordance with (43). The result is 


P(x, y3 x’, y', t) = Co'Cy'exp(4 + A cos2mt + B sin2wt), (57) 
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in which 
N N 
b - Re } 1 
A= D [rs FT, = 2 ls cos (9, =, 9)] i: 2 ie BE ry’) ? (58) 
s=1 s=1 


N 
b 2 12 
A= - [r; cos2@, + ry’ cos 2p; — 2yr,r, cos(e, + —)] 
s=1 


N 


1 2 12 ’ 
= (r; cos2g, + rr.’ cos2)), (59) 


s=1 


N 


b 
B=- ; ) [r; sin 20, + r? sin 2@) — 2yr,r; sin(@, + @)] 


s=1 


(r2 sin 29, + r? sin2¢%). (60) 


[C~—J= 


1 
* 2 
s=] 
This rather cumbrous result should be averaged over a period 34a, if we 
assume that the frequency characterizing the atom-detector bond is fairly 
high. 
The observed result is then defined by the matrix 


+n/2 
B(x, y; x’, y’, t) = CoC, exp \4 : | [(A cos 2@t + B sin 2at)] ash 
™ 2, 
(61) 
The latter integral gives rise to a Bessel function: 
+n/2 
: | exp[(A cos2wt + B sin2at)] dt = I,(R), (62) 
—x/2 


in which 
R=./A? +B 
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So the matrix 
p(x, yx’, y’, t) 


averaged with respect to time, is 


B(x, ys x’, Y’, 1) = CoCy e**1o(R), (63) 

Ip(R)=1+4R* ++, [RI <1. (64) 
ek 

Ip(R) = ——=+4-, [R|>1. (64’) 
./2nR 


Hence for small R we have 
p(x, ys x, y's t) = CoC e* . (65) 


and for large R 
A+R 


e 
es 2nR 

Remembering that b=—1f$=-—6/2, the expA factor containing A 
arising from (58) indicates that energy is transferred between the x and y 
modes, the temperature falling from @ to 6/2. 

The result for large R also shows that the energy distribution between 
the x and y modes alters, but this is not so obvious as for small R. 

Thus we see that a microparticle entering a thermodynamically un- 
stable detector causes there a complete change in the energy distribution, 
i.e. a macroscopic effect. 

These examples show that a macroscopic measuring instrument must 
be an unstable system (more precisely, almost unstable); only then can a 
microparticle alter its state, and this change of state is a macroscopic 
event. A microparticle is powerless to act on an instrument represented 


by a stable macroscopic system; it cannot “‘displace the pointer” from 
the zero position. 


p(x, y3 x’, y’, t) = CoC (65’) 
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CHAPTER XIV 


THE WAVE FUNCTION AS THE 
OBSERVER’S NOTEBOOK 


This representation (as the book in which the observer records the 
results of his measurements on microsystems) may not be correct, as we 
may see from the answer usually given in textbooks. Let us suppose that 
from some previous measurements it is known that the wave function 
(x) represents the state of the microsystems in the ensemble. 

The observer enters in his book the simple sign 


Yu (x)- (1) 


This in principle allows the observer to predict the probability of the 
results from any possible measurement on the microsystems wy. 

We assume that the observer intended to measure a quantity L having 
(for simplicity) only two proper values ZL, and L,, the corresponding 
proper functions being ,(x) and w,(x); these might for example be the 
spin states of the particle. Then Y,,(x) may be represented as the super- 
position of the particular states W, (x) and ¥,(x): 


Y (x) = Cy, (x) + Coo (x), (2) 


in which C, and C, are coefficients defining the relative participation of 
the states with L=L, and L=L,. We assume that the measurement is 
made and indicates that L=L,; the observer then assigns the particle that 
has passed the check point (the measuring instrument) to a new ensemble 
characterized by the new wave function ,(x). The observer takes his 
notebook and strikes out ¥,,(x) as out of date and unsuitable for further 
predictions of results and fresh measurements on the particle that has 
undergone a measurement. He replaces ¥y,(x) by 


Wy (x). (3) 
The transition from Wy,(x) to W, (x) is 
(x) = CW (x) + Car (x) > Wi (x) (4) 
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and is a process of ‘“‘contraction”’ of a wave packet. From this viewpoint 
the contraction is a direct consequence of the change in the observer’s 
information. The wave function itself is simply a routine record of his 
information on the state of the ensemble of microsystems. There is nothing 
wrong in this very common view of the wave function and of the con- 
traction process, and it is convenient as a formulation against which it is 
difficult to raise objection. In using the words “‘observer’’, ““measurement’’, 
“‘information’’, and so on we merely pay respect to the physicists’ common 
jargon, which is in no way a lucid tongue for the discussion of major 
philosophical and methodological topics in physics. The entire theory of 
measurement thereby acquires a dubious taint of subjectivism, which 
becomes quite insupportable if we have to answer the question whether 
quantum mechanics 1s applicable to the description of physical phenomena 
occurring in the absence of an observer. 

The observer is not an entirely obligatory being in this world, for 
quantum laws would scarcely be altered by one iota if this restless observer 
were removed from the scene. 

In one of his papers on quantum mechanics Schrodinger gave an 
example of superposition of states which many readers found unnerving. 
He considered a microsystem having two states w, and wW,; the first 
causes a Geiger counter to fire, while the second leaves it undisturbed. 
The firing of the counter actuates an amplifier to break a tube of prussic 
acid in a chamber containing a cat.}, 2.8 

Thus we have that the observer, looking into his notebook in order to 
predict the result of a future measurement, finds among the possible 
results of observation the fact of a possible interference between the 
states of life and death ofa cat. In fact, (4) implies that 


[Wael = ICyw,|? + |Cow.|? +2 Re CiCW 2 (5) 


and the last term indicates this astonishing possibility. 

After observation of the actual event (w, or w.), medicine certifies 
either the death of the unfortunate cat or that it is alive and well, while 
the wave function is condensed in that report to w, or Wp. 

It is not difficult to see that this rather horrifying example may be 
made even more disturbing if we replace the cat by the observer together 
with medical attendants; then in the case of y, there will be no one to 
contract the wave function. 
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However, we must return to more realistic examples. Consider the 
decomposition of a radioactive atom, W, being the state before decay and 
W, that after decay. Theory shows that C,+exp(— Az), in which f is time, 
with 7=1/A the decay period; 


IcsP =1- [cP 


so C, increases with time, whereas C, decreases. Now imagine that we are 
speaking of the remote past, when no observer could have transmitted 
to us information on the actual fate of the radioactive atom, e.g. during 
the time of the dinosaurs. If the interval ¢ separating us from that time 
greatly exceeds 7, we can say with high probability that the atom has 
decayed; but it is far from being a matter of complete indifference to the 
atom’s surroundings at what time it actually did decay. 

We may recall Ray Bradbury’s story of travellers into the prehistoric 
past who carelessly crushed a butterfly, this event then influencing the 
outcome of the presidential election in the USA in the year 2000. 

The decay of an atom may cause a chain of events whose detailed 
content may be very much dependent on the exact moment of decay. On 
the other hand, our modern observer has not yet had the chance to 
contract the wave function from the Y containing the superposition of 
two possibilities: the atom has decayed (,) or is still in its initial state 
(1). 

If the observer takes the trouble to measure the state of the atom, he 
will most probably find that it has decayed and is in state w,; but our 
contemporary is very late in reaching his conclusion if t> 7, because any 
other observer would have reached the same conclusion before him. That 
is, we could assert in the language of quantum mechanics that the atom 
has decayed, quite without reference to the observer. The event may lead 
to consequences dependent on the precise instant of decay and therefore 
cannot be related to changes in the observer’s information. The observer 
plays no part in the events of which we speak and so must be excluded 
from the game. 

Imagine a series of observers, of whom one is our contemporary and 
the others are successively earlier; in this series there will be one who was 
the first to record the fact of decay. This moment will be of objective 
significance and should be reflected in the apparatus of quantum me- 
chanics without reference to the observer. 
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There is now a very obvious answer to the several paradoxes raised 
previously, which follows from the above analysis of the physical signifi- 
cance of a measurement as a macroscopic process triggered by a micro- 
system. Nevertheless, the importance of the observer problem leads me to 
consider this aspect in somewhat greater detail and to summarize the 
above survey of the essence of measurement. 

We denote by It the entire macroscopic system in which the relevant 
microevents occur, the system being represented as a sum 


M=M+(A+D)+0, (6) 


in which M denotes the part of the system termed preparative and which 
governs the initial state %,, of the ensemble of microparticles; A+D 
denotes the part in which a microparticle produces a macroscopic event 
and hence gives evidence of its state, this part being nominally divided 
into an analyser A (which causes the macroscopic event to be dependent 
on the state of the microparticle) and a detector D (which changes its 
state in response to the microparticle). Finally we include the observer O 
to cover the case where he considers that he affects the behaviour of the 
quantum ensemble. Of course, it is usual for the experimentalist to 
minimize his influence on the phenomenon as one of the primary points 
in designing the experiment. 

It will be evident that we may generally delete the O (the honourable 
observer) from the above sum. 

The entire essence of the contraction of the wave function is expressed 
in the fact that the microparticle produces a macroscopic effect, and the 
latter is of entirely objective significance, being quite enrelated to any 
information the observer may have on the occurrence of the event. 

The part A+ D may be such, or may be arranged to be such, that it 
serves aS a measuring instrument under certain conditions, this being 
done deliberately. Moreover, the experimentalist can modify the part 
A+D, i.e. to carry out his measurements in another way, but subject 
to the restriction that he avoids thereby having any effect on the part 
M, as otherwise he may alter the initial ensemble. The division of IN 
into parts is only approximate, so any change in A+D requires some 
caution. 

Since the observer is interested in various measurements, we may use 
the following representation for a set of possible measurements in terms 
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of symbolic forms for the macroscopic setting: 


Mt=M+(A+D)+0, 
W’=M+(A+D) +0, (7) 
WM’ =M+(At+D)" +0, 


in which the primes in(A+D)’, (A+ D)", etc, denote that different meas- 
uring instruments are used, whereas the part M (which defines the initial 
ensemble) is kept unchanged as far as possible. This simple example 
serves to stress once again how the concept of quantum ensembles differs 
from other approaches to quantum mechanics: to a given quantum 
ensemble defined by a macrosetting M (which may be represented 
mathematically by the wave function ¥y, or the density matrix p,,) there 
belong infinitely many statistical sets of measurement results relating to 
the various measuring instruments 


(A+D), (A+D), (A+D)’... 


and so on, among which there may be instruments that measure comple- 
mentary dynamic variables, i.e. ones that cannot in principle be measured 
by the same instrument, such as space-time and energy-momentum 
measurements. 

If we assume that the observer is justified in his decision to change part 
of the macrosetting, i.e. the measuring instrument A+ D, then it is clear 
that objectively (i.e. without dependence on the observer) the quantum 
statistical collective of microsystems is specified by the fact that it remains 
unchanged during a replacement of one instrument by another, i.e. by 
the wave function Y,, or the density matrix py. 
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CHAPTER XV 
IS QUANTUM MECHANICS A COMPLETE THEORY? 


This problem was raised by A. Einstein, who gave an example that he 
considered to imply a negative answer. This example, known as the 
Einstein-Rosen-Podolski paradox!, at one time engaged the attention of 
all physicists concerned with the basic problems of quantum theory. 
Nowadays it appears more difficult to formulate the “paradox” than to 
explain it, but we shall nevertheless examine it. 

Suppose that the wave function of a system consisting of particles 1 and 
2 has the form 


Vee \= { eit 2#8) dy, (1) 


in which x, and x, are the coordinates of the particles; for these, p (the 
momentum) assumes the values p, =p and p, = —p. This function may be 
put in the x representation as 


Wey = 2) = [ 5G = x1) 5G, — x2 + a) de. (2) 


We assume that we have measured the momentum of the first particle 
and found that p, =p’; then (1) implies that p,=—p’. Hence, although 
we have performed no action on the second particle, the packet of (1) has 
been contracted to the function 


exp Lip’ (x, = to a)| ’ 


and the coordinate x, of particle 2 has become completely undefined. The 
same applies if the coordinate x, is measured: Let us suppose that we find 
x,=x'. Then (2) shows that x,=a+x’}, i.e. the packet of (2) has con- 
tracted to 6(x; —x,+a). Here again we find that the measurement on the 
first particle has altered the state of the second, although by measuring 
coordinate x, we may (at least in principle) avoid any influence from the 
instrument on the second particle. Hence measurements on one particle 
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alter the state of another, and that in such a manner as to alter the initial 
wave function W(x,—.x,) in one of the following ways: 


Wer- 3) 


eiP (m1 +x2+a) 


d(x, —x2, +a). @) 


Thus we have established that: a) the state of particle 2 is altered no 
matter whether the measuring instrument has any influence on this 
particle, and b) the new states of particle 2 are mutually exclusive, since 
they are either states with defined momentum p,=-—p’ or states with 
defined coordinate x, =a+ x‘. Einstein, Rosen, and Podolski based their 
approach on the then common view that a particle ‘in reality’ may have a 
coordinate and a momentum simultaneously, but that the disturbance 
due to the measuring instrument (unmeasurable disturbance in Bohr’s 
terms) does not allow one to establish the simultaneous values of x and p. 
Therefore it was assumed that the concept of coexistence of coordinate x 
and momentum p must be eliminated from the theory, as not corre- 
sponding to the capacities of the observer. 

The Einstein-Rosen-Podolski example has shown that the pair of 
canonically conjugate variables x and p cannot be measured together, in 
spite of the absence of any disturbance due to the instrument. The impossi- 
bility arises on account of the change in the state of particle 2, which 
arises without any action on it from the instrument. This lack of action 
means that there is no reason why x and p should not be measured 
simultaneously if this pair (x, p) occurs in a physical reality that quantum 
mechanics is, unfortunately, unable to describe in its language. Conse- 
quently, this would mean that quantum mechanics is not complete, which 
was the conclusion reached by the authors of the paradox. 

This paradox has been discussed by N. Bohr? and L. Mandel’shtam? on 
the basis of different arguments, but ones that are not mutually in conflict. 

Bohr based his reply to Einstein on the principle of complementarity; 
he stressed its significance as a new physical principle that rules out states 
of a particle having simultaneously defined canonically conjugate 
variables, e.g. x and p. From this viewpoint, no measurement can lead 
to a state with simultaneously defined values of x and p, no matter whether 
the means of measurement is direct or indirect (in the paradox, the 
variables x and p for the second particle are measured indirectly via the 
measurement on the first particle), and this is so no matter whether the 
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instrument produces a disturbance or not. Such a measurement would 
contradict the principle of complementarity. 

There is nothing wrong with Bohr’s explanation, but Mandel’shtam’s 
interpretation of the paradox appears to reveal its essence more fully. 

Mandel’shtam’s interpretation is that the change in the state of the 
second particle ‘“without disturbance from the instrument” is actually 
due to the correlation of the states of particles 1 and 2 in the initial 
ensemble; the established fact p;=p’ implies that p,=-—p’ as a direct 
consequence of the correlation. Thus Mandel’shtam employs a statistical 
interpretation of quantum mechanics to explain the paradox. The 
significance of the correlation may be elucidated from a more general 
example of two particles. Let the state of particles | and 2 be characterized 
by their belonging to a quantum ensemble described at t=0 by the wave 
function 


W (x1, X25 0) = Wa(x1) We (x2); (4) 


in which x, and x, are the coordinates of the particles, a and b being the 
values of certain other dynamic variables whose specification serves to 
define the initial ensemble. 

The interaction between the particles gives rise to a new state for 
t>0, which is described by the wave function w(x,, x2, t). We expand 
this as a spectrum with respect to the proper functions of operators A and 
B that represent the dynamic variables a and b: 


(cmos | C(a’, bY) We (x) War (x2) da’ db’. (5) 


This spectral expansion shows that a measurement a=a’ allows us to 
say nothing about the value of variable b and hence about the state of 
particle 2 after the measurement on particle 1, which gave the result 
a=a'. The situation is altered if the system consisting of particles 1 and 2 
is subject to some special laws; e.g. a and b may be subject to a law of 
conservation such that 


“(of +9) =[#, (o + @)] =0, ©) 


in which # is the hamiltonian of system 1+2. Here the coefficients 
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C(a’, b’, t) in (5) take the form 
C(a’, b’, th=d(a’, b’, t) d(a’ +b’ -—a—b), (7) 


which indicates that the sum a’+b’ must be conserved, and that in this 
case (5) is replaced by 


W (x1, Xz, t= [ac at+b—a’,t) Wy (x1) Wa45-a'(X2) da’, (8) 


so that a measurement giving a=a’ automatically implies b’=a+b—a’. 
The correlation of the states of particles 1 and 2 in this example is a 
consequence of the laws of interaction of the particles, i.e. the law of 
conservation for the quantity described by the operator (A+B). 

Any measuring device that splits up the initial state into a spectrum in 
terms of the states 


War (x1) Wy (%2), 
automatically gives a spectrum in terms of the states 


War (x,) Wa+b—a’ (x2) ’ 


since there is no state with b’4a+b—da’ in the initial ensemble. 

There is a strict relation between the values of a and b on account of the 
dynamics of the interaction. In the paradox we have a=p, and b=p,, 
with p, +p,=0 at t=0. The form of W(x,, x2) in Einstein’s example (1) 
implies that momentum Is conserved in the interaction of particles 1 and 2, 
which provides the correlation between the states of particles 1 and 2. 

The paradox thus arose from neglect of the statistical character of 
quantum mechanics and also of the statistical correlation between the 
states of particles 1 and 2 consequent on the dynamics of the interaction. 

Imagine a society whose laws of inheritance happen to be such that 
fair-haired people have only blue eyes while dark-haired people have only 
brown eyes. There is then no implication that the subject tends to be 
coloured by the mere act of selection in the observation that by selecting 
an individual with blue eyes we automatically find light hair in that 
individual; the observed correlation of hair and eye colours is a result of 
the laws of inheritance in that population. 

Returning to the Einstein-Rosen-Podolski paradox, we see that there 
is no mysticism in the apparent influence of the instrument on the state 
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of the particle, which in fact is not affected; the apparent “influence’’ is 
produced within the system by its own internal laws and is not a conse- 
quence of “‘interference’’ from the instrument. 

Here we may recall that we have repeatedly stressed: a microparticle 
always interferes with the state of the instrument, whereas the instrument 
sometimes does, and sometimes does not, interfere with the state of the 
particle. Hence we must accept as unsatisfactory the idea that quantum 
mechanics is a direct consequence of the capabilities of macroscopic 
instruments; quantum mechanics reflects objective laws that govern the 
microworld but speaks of them in the language of the macroscopic world. 
In particular, it is impossible to measure x and p simultaneously for a 
microparticle as a consequence of the basic laws of quantum mechanics, 
and any instrument is bound by these: the laws state that the pair x and p 
does not relate to “physical reality’? and so cannot be observed under any 
conditions. 

This means that we must reject the assumption in the paradox that the 
pair x and p actually exists and that only the incompleteness of quantum 
mechanics prevents us from bringing the pair together. 
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CHAPTER XVI 


LATENT PARAMETERS 


The abstract form of classical mechanics allows one to predict un- 
ambiguously the future of a system if the initial data are known. There is 
no need to stress how illusory is faith in such omnipotence of classical 
mechanics. However, this is precisely the basis on which classical statistical 
mechanics is often considered as a theory of the second kind, to which it 
is necessary to resort when a mechanical system becomes too complex 
for the equations of mechanics to be applied directly. 

Physicists and philosophers who take this view of statistical mechanics 
usually consider that, in principle, the strictly deterministic equations of 
classical mechanics could be applied; thus statistical mechanics is viewed 
simply as a device for avoiding an excessively complex computational 
problem. 

This naive viewpoint is difficult to reconcile with the fact that dynamics 
and statistics are inseparable in quantum mechanics, and the latter does 
not allow the most powerful of mathematicians (or computers) to abandon 
the statistical description cf the microworld even in principle. There is a 
tendency to believe that quantum mechanics is merely a statistical method 
for the description of microsystems that “‘in fact’ are subject to dynamical 
laws if they are described in terms of variables unknown in quantum 
mechanics, namely the latent parameters 4. Specification of these para- 
meters would then eliminate statistics from quantum theory. It would be 
more accurate to say that the statistics of a quantum ensemble would 
arise as a result of inexact specification of these remarkable parameters. 

This problem of the latent parameters 4 has been discussed repeatedly ; 
von Neumann appears to be the first to have directed attention to the 
problem. His outstanding monograph on the mathematical principles of 
quantum mechanics has had a great influence on the development of the 
idea of quantum ensembles in the USSR; von Neumann endeavoured to 
show that latent parameters are incompatible with the basic principles of 
quantum mechanics.! All the same, it appears that complete clarity was 
not attained, and that the problem demands further analysis. 
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The problem is one extremely difficult to discuss in any very general 
form. In one of his papers the present author has pointed out that it would 
be groundless to assume that thermodynamics may contain within itself 
the basis for doubting the absolute significance of the second law of 
thermodynamics, for that law lies at the very basis of the theory. How- 
ever, progress in the kinetic theory showed that there are cases where the 
second law is explicitly violated without other conflicts with thermo- 
dynamics, as for example, in Brownian motion. The contradiction in this 
case was removed by adopting a wider view on the second law, taken in the 
kinetic theory of matter, namely the conception of the entropy S as a 
quantity defining the probability W of a state: S=k In W, in which k is 
Boltzmann’s constant. 

Another possibility to examine is the introduction of latent parameters 
such as to give meaning to a proportion of the form 


x kinetic theory of matter (1) 


quantum mechanics thermodynamics 


in which x is some unknown (more complete) theory. 

It cannot be denied that the symbolic equation of (1), or some similar 
one, might be soluble, at least in this extremely general and purely 
methodological formulation of the problem. In what follows we shall 
attempt to select some narrower but more closely defined ways of dis- 
cussing these latent parameters, which we may divide into two possible 
classes: 

A) The parameters 2 are variables observable in principle but not 
included among the purely dynamic variables considered in quantum 
mechanics; 

B) The parameters A are unobservable, even in principle. 

These two classes must be discussed entirely separately. 


A. OBSERVABLE LATENT PARAMETERS 


Assume that the variables characterizing the macroscopic setting M and 
the dynamic variables of a microparticle (L, F, Q, etc.) are accompanied 
by additional variables 4 (latent parameters), the latter allowing one to 
complete the definition of the state of a quantum system in such a way as 
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to reduce the random spread found with the usual dynamic and other 
variables of the quantum ensemble M+ p1. 

Von Neumann distinguished two types of statistical ensemble: in- 
homogeneous and homogeneous. In the first type the mean value A of a 
positive random quantity A may be expressed as a sum: 


A = 1A, + OAs > (2) 
in which 
“+a,=1, 1, 4, >0 


and A, and A, are means taken over subensembles via some particular 
mode of selection of systems present in the ensemble, i.e. selection by 
reference to the features that determine the probabilities «, and «,. 

For a homogeneous ensemble subject to any mode of selection we have 


A= 4, = 4, =. 3) 


A mixed quantum ensemble is clearly inhomogeneous. A very simple 
example of a mixed ensemble is the one formed by particles derived from 
two incoherent sources (e.g. the a-rays from two radioactive samples). 
Any mean relating to these particles is to be represented as in (2), a, and 
a, having the meaning of the probabilities that the particles come from 
the two sources. 

This expansion is impossible, by virtue of the definition of a pure 
ensemble, if the ensemble is described by a single wave function ’y and 
hence is a pure ensemble. The definition of the latter implies that it consists 
of an infinite set of identical macroscopic settings M, which produce the 
conditions of existence for the microparticles uw. Set M, of number N00, 
may be imagined as divided into two subsets 


each subset N, or N, having identical M and yw. By identical is meant here 
that the macroscopic parameters defining M are the same, while macro- 
scopically measurable parameters of the yw (such as mass m, charge e, 
spin a, etc.) are also identical. Then the %y, that defines the quantum 
ensemble for subset N, or N, is precisely the same, by virtue of the defi- 
nition of the wave function itself, and so for any measurable quantity A 
we have A, =A,; hence the decomposition as in (2) is impossible. 
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This conclusion cannot conform with the possibility of introducing 
latent parameters A, since the latter are essentially such as would, if 
measured, allow us to ‘‘define more closely”’ the quantum variables, e.g. 
they would allow us to alter the variance of some dynamic variable L. For 
instance, suppose that in the range J,(A) of the latent parameters the 
mean L takes the value L,, with variance AL?, while in region J,(A) the 
corresponding values are L, and AL?. Then £ and L? for the entire 
ensemble are 


L=a,L, +L, (4) 
AL = 0,:AL, + a3, +.0,(L,— L)? +.4,(L, — )*, (4’) 
in which «, and a, are the relative weights of the regions J, (A) and J,(A), 
with 
G,(A), Oy +a,=1, a1, a,2>0. 
In other words, a pure ensemble is inhomogeneous (in the parameters 
A), which is contrary to the definition. 


If now we seek to define more closely these 2 by dividing the entire 
region of possible values of A into smaller parts 


G, (A), Gz (A), «1 Ge(4), «+5 Gy (A) 
with weights 


N 
Pi Oia Cosas ed 
I 


then we find for the limit of infinitely small regions J,(A) that (4’) is 
replaced by 


AD = Ya (L, sed Ly’ ’ (5) 
in which 
G,(4) 
oe 
Y 6,(2) 


and L, is the mean of L in J,(A), this mean equalling the exact value of L 
in region J,(A) for Noo. 

Formula (5) shows that the entire random spread in L is now due to the 
uncertainty in the A; the quantum dynamic variables should take com- 


“= 


Li? 
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pletely definite values if these parameters are determined accurately, i.e. 
if J,(A) is a narrow region. 

This leads to further contradictions; there are no microsystems that can 
be described in terms of space-time (Q-type) variables alone or in terms 
of momentum-energy (P-type) ones alone. The principle of complemen- 
tarity indicates that the operators representing these quantities are not 
commutative: 

P2Q— QP =ih, (6) 


so AP*<0 if AQ?=0 ina quantum ensemble defined by the macro- 
setting M; similarly, AQ*#40 if AP? =0. Hence if some dynamic variable 
L of a quantum ensemble is determined exactly (AV = 0), we have either 
AP?> 0, or AQ? >0, or both simultaneously. Hence we would arrive at a 
conflict with the principle of complementanity if we were to find a range 
J(A) of the latent parameters that would give a subensemble with AQ? =0 
and AP? =0. 

The following argument? also confirms the difficulty of introducing 
these latent parameters 4. Let the relevant dynamic variable be the 
electron spin S=(S,, S,, S,). We have 


[S, 5] = ihS,. (7) 


The two possible values for the variables S,, S, and S, are +h/2. We 
assume that it has been established that the projection of the spin on the 
OZ axis is S,= +h/2. Let this value of the quantum variable be a conse- 
quence of the fact that the 4 lie in the range J; (A). 

We now alter the total system It=.#+(A+D),, replacing (A+ D), by 
(A+D),, thereby sorting the particles 4 in accordance with the variable 
S,, to get values of the latter equal to +//2 and —A/2. Let J} (A) be the 
range in A giving S,=+h/2. Then J; (A) splits up as 


Gz (A) = Gr (A) + GA). (8) 


Suppose we have found that S,= +//2. This means that the / lie in the 
range J, (A). We now again measure the projection of S on the OZ axis, 
for which purpose we replace (4 + D), by the starting (A+D),. We then 
obtain values of +//2 and —/A/2 for S,, so that 


G} (a) = Gi (a) + G (2). (9) 
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Comparison of (8) with (9) gives us the contradiction 
Gi (A) + G; (A) =0. (10) 


We may escape formally from this if we assume that the A relate to the 
measuring instrument A+ D, not to @+4y; on replacing (A+D), by 
(A+ D), or vice versa, the 4 change their physical significance. The latent 
parameters when S, is measured are not those when S,, is measured, so 
that A on the right and left in (8) are physically different. In other words, 
the A vary with the measuring devices A+ D, here (A+D), and (A+D),. 
The above possibility cannot be rejected a priori, but this view of the A 
does not refer them to the class of quantities that supplement the dynamic 
variables of quantum mechanics, for they obviously belong to the class of 
quantities considered in quantum mechanics, e.g. the variables that 
characterize the measuring instrument, whose essence is such as to produce 
a unique relation between the measured quantity L and the range J(A) 
of the A characteristic of the device. That is, if Ae J,,(A), then the dynamic 
variable equals L. 

Thus we see that it is impossible to introduce latent parameters / that 
are in principle observable into section M+y of Mt without causing a 
conflict with the principle of complementarity. 


B. UNOBSERVABLE LATENT PARAMETERS 


We now assume that the J are not observable even in principle. The term 
“in principle” recalls the notorious “the basis of the principial observ- 
ability’’ that some have considered as the philosophical basis of quantum 
mechanics®, and it must therefore be stressed that the present “in 
principle” implies a definite theory and its principles, which allow some 
things and forbid others. In discussing “‘essentially unobservable latent 
parameters” we are also obliged to have in mind the principles of some 
definite theory, not a vast ocean of possibilities limited only by the power 
of imagination. As our theoretical basis of discussion we take quantum 
mechanics and assume that the term “essentially unobservable quantity” 
defines a quantity whose observability is forbidden by the principles of 
quantum mechanics. This conception of the A does not conflict with the 
principles of quantum mechanics. 

Consider now the latent parameters A of a pure quantum ensemble; if 
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these are unobservable, we cannot point to any method that would allow 
us to isolate from the pure ensemble any subensemble whose means L 


and L? differ from those characteristic of the ensemble as a whole. That iS, 
a decomposition as in (4) or (4’) is not feasible, so there is no conflict 
with the definition of pure ensemble, nor any with uncertainty relation- 
ships of the form 4P?. AQ? >a’, given these assumptions about the 4. 
Moreover, we shall show that such unobservable parameters 4 occur in 
current quantum mechanics. Henceforth we shall follow R. Feynman’s 
lines of thought4; Feynman has shown that the wave function describing 
a pure ensemble may be represented as an integral over “‘ideal’’ (un- 
observable) particle trajectories x=x(t), dx/dt=x(t), in which the x(t) 
are the coordinates of the particle at times ¢ and x(t) is the velocity. 

Feynman’s approach may be presented by recalling that a quantum 
ensemble may be regarded as a series of unitary transformations. Let the 
state of the microparticle be represented by the dynamic variable a, which 
has a continuous spectrum. Let further (do, fo) be the wave function 
describing the state of the ensemble at time t= fo, in which dg is the value 
of a at t=fg (from the definition of the wave function). Then the wave 
function w(a, t) for t>0 may be put in the form 


(a, t) = | U (a, t; ao» to) W (ao, to) day, (11) 
in which 


U(a, t; ao; to) 


is the unitary matrix for the canonical transformation from the variables 
Ay to the variables a, which satisfies Schrédinger’s equation: 


aU 
ih — = HU, (12) 
ar 


in which # is the hamiltonian of the system. If for U we take initial 


conditions such that 
U=6(a—a,) for t=to, (13) 


U=0 for t<to, (13’) 


then this U coincides with the Green’s function G for (12) considered 
earlier. 
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We take in (11) some time ¢, such that tg <¢, <¢in place of the arbitrary 
t; then (11) transforms (do, to) into W(a;,, t,). We can now take (a, ¢,) 
as the initial function and use U(a, t; a,, t;) to obtain once more the 
function (a, t) for t>t,; we obviously get 


W(a, t) = [ea | 820 U (a, t; ay, ty) U(ay, 13 Go, to) Wo (ao; to)- (14) 


Repeating this procedure for intermediate points in time 
to<ty<tp <0 <ty=t 
and for the values of the dynamic variable 
G1, Q2,..., dy = 4, =a, 
corresponding to these instants, we can represent the transformation 
matrix U(a, t; ao, to) in the form 
U (a, t; do, to) = [~ [ea da,... day, U(azt; @y—1, ty-1) 
x U(ay-1; ty-13 @n—2, ty—2) ++» U (a2, #23 a1, t1) 
x U (ay, ty ao; to), (15) 
i.e., aS a series of “‘transitions”’ 
Bg? Gye ag Pe Ona ay 


for possible intermediate values of the variables a,, a2,..., ay_,. This 
formula is analogous to the classical formula® for a Markoff chain: 


P(a, tlao, to) = { ~ | da, da,...day_, Pla, t; aAn-1> ty 1) 


x P(ay—4, ty—13 4n—2) ty—2) +» P (Gata; a1, t1) 
xP (aisitys Gis to), (16) 
in which 
P(a,, I, ay-15 t,-1) 


is the probability of a transition from state a,_, to state a, in the period 
t,—t,-,. Formula (16) is the probability of transition from state ag at 
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time fy to state a at time t(t> 9) as a result of all possiblet ransitions via 
intermediate values of a. 
The quantum formula (15) has the probability P replaced by the 
probability amplitude U which is related to P by 
P(x, tes Mp1 1) = |U (a, tes Ax—15 t—1) ae (17) 


This distinction means that the externally similar formulae (15) and (16) 
are fundamentally different. 

In relation to (14) and (15), we assume that a is the cartesian coordinate 
x of the microparticle; we put ¢,,,—4,=At and assume that the transfor- 
mation function U for small At may be put as 


1 i 
U (X,4 49 Leis Xk, ty) = a exp] 5 S (X41 Xho 4], (18) 


in which A is a normalization factor (independent of x, and 4,) and 
S (X15 Xp, At) 


is the phase of U. This phase is exactly the classical action function in the 
case of free motion, so we may assume that it remains correct also in the 
presence of an external field described by a potential V(x). We suppose 
that this is true; then 


mfx =X a 
Ske a=)" (1%) - V (sae df At, (19) 


in which m is the mass of the microparticle. In the limit of infinite sub- 
division of the interval (to, t) we get 


S(x, x, dt) = {4mx? — V(x)} de. (20) 
This is the element of classical action in time d/¢; 


«Xe — XE 
x(t) = lim ——— 
() At>0 At 


is the velocity of the particle at time t, and 
L (x, x) = 4mx* — V(x) 
is the classical Lagrange function. We shall show that the (x, ¢) given 
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by (11) subject to (18) and (19) satisfies the Schrédinger equation 


; aw (x, t) 
ot 


= | e z - r va} W(x, 2). (21) 


The proof* may be given by considering the transformation from 


w(x, ¢) to W(x, t+ At): 


w(x, t+ At) = s{e|; * [eT — V (na) at| 


W (xy, t)dxy. (22) 
We put 


so (22) is replaced by 


w(x,t+ At) = fel; > a - ~V (x) at as w(x — €, t). (22') 
We expand the 
W(x —¢, t)=y(x, 0 2 ieee nee D sya 


appearing here and perform a simple integration with respect to € to get 


ow (x, JP 
Ot 
i 2nih At\* 1 
=exo| =| V (x) at|( re ) yi 


{vs ee t) 


w(x, + 


At + ocaryh (22’’) 
We take our normalization factor A as equal to 
2nih At\* 
m 


* This proof is due to Feynman‘; Dirac® appears to have been the first to point out 
the relation of the phase S to the classical action Y. 
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and perform the series expansion 
e UAV 4t = 1 — ifh-V (x) Atte, 


to demonstrate that w(x, t) for t-0 actually does satisfy (21). This 
allows us to put (15) as 


N 


U(X, £5 X95 t9) = |~-Joo]; Sexe sas a) 
1 


k=1 
Oxyd dx, dx 
ee ee. 08) 
A A A A 
in which 
S (X53 Xk~15 At) 


is the classical action in time 4t given by (19). 
If we replace 1/h by 


the integral of (23) has the meaning of a functional integral with respect 
to a Wiener metric also for 


t— lo 
N-o, At=—-—-0, 
N 


The integral of (23) for No is the limit of this functional integral for 
6—0 (see ref. 7); as 4t-0 the sum 


ny S (Xp, Xe—1, 4b) 
k=1 


is shown by (10) to go over to the integral of the classical action in the 
period from fy to ¢t: 


| 20. x(t)] dt = S(x, t, Xo, to), 


the volume 
dxy dxy-1 dx, 


oe 


A A A 
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for Noo being put symbolically as dW {x(z)}. Then (23) is replaced by 


t 


Cn oe | dW {x(z)} exp F | (z(t), x() ar | (24) 


to 


In other words, the transformation function (probability amplitude) 
U(x, t3 Xo, to) 


may be represented as a functional integral over all trajectories leading 
from point (Xp, tg) to point (x, t). Each trajectory 


x(t), x(t), to Sts 


gives a contribution to 
U (x, ty Xo» to) 


equal to 
t 


dU =dW {x(2)} exp F | L(x, x) acl. (25) 


to 


The constancy of A means that the amplitude dW{x(t)} for each 
trajectory is the same, whereas the phases 


t 
ac (x, x) dt 
h 
to 
are different. The factors 


exp E S(x, t, )}, 


S6-4S | 2 ide: 


to 


are oscillatory, so the largest contribution to (24) clearly arises from 
terms of the form of (25) for which S(x, t, tg) is minimal, i.e. from those 
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trajectories for which 


t 
5] 2G, x) dt=0, (26) 
to 


where 6 denotes a variation of trajectory x(t). However, condition (26) 
is also the condition that defines the classical trajectory of the particle, 
SO we may interpret as follows Feynman’s striking result represented by 
formulae (24), (25), and (26): a microparticle prefers to move along the 
classical trajectory x(t), which satisfies the principle of least action 
expressed by (26), but it does not avoid the other possible trajectories 
leading from (x9, to) to (x, t) (Figure 11); (24) shows that all of these 


Fig. 11. Classical trajectory S of a particle corresponding to a minimum in the action 
function along S; paths S’, S”, etc. are ideal quantum ones, which also play a part in 
the motion of a microparticle, 


possible trajectories make a definite contribution to the motion of the 
quantum ensemble. However, the ‘‘ideal”’ trajectories x(t) and velocities 
x(t)=p(t)/m (in which p is the momentum of the particle) appearing in 
(27) and (14) are unobservable, because the basic principles of quantum 
mechanics indicate that there is no quantum ensemble in which the 
variables x and x=p/m would be defined simultaneously. 

Thus the x(t) and x(t) of (24) provide an example of latent parameters, 
and unobservable ones at that; however, we have seen that there is no 
conflict between the principles of quantum mechanics and the existence 
of these latent parameters A= x(t), x(t) that determine the “ideal” 
path of a microparticle. 
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CHAPTER XVII 


CAN A PARTICLE HAVE AN 
INDIVIDUAL HISTORY? 


Some people tend to be disturbed by the failure of current quantum 
mechanics to describe the fate of a single electron. This concern for the 
fate of a single microparticle, a single individual in the microworld, 
seems rather hypocritical for even in the macrocosm the fate of a given 
object cannot be predicted over a prolonged period. 

Perhaps this desire for prediction derives from the natural longing of 
man to know his future, which is sometimes so acute that some people 
even resort to fortune tellers. However, it is not difficult to appreciate 
the psychological state of a thinking man who would know precisely the 
entire course of his life; would it not lose for him all interest, as does a 
film whose plot is known in all its details? 

This comment has no direct application to the problems of quantum 
mechanics, but it may tend to assure those who continually repeat that the 
future cannot be predicted unambiguously, not even for an elementary 
particle. 

Consider now the history of an individual particle in the microcosm. 
First we must define more closely what we mean by this history, and what 
the history of a particle might signify. It is clear that it is correct to view 
the history of a particle as a sequence of its states in time, so we must 
start with the concept of the state of a particle. 

We know from the principles of quantum theory that we can describe 
the state of a particle via space-time variables (denoted by Q) or via 
momentum-energy ones (denoted by P); each set Q or P must form a 
complete set of dynamic variables, i.e. they must be represented by 
commutative operators 


P22 | S05: SPs Dass tS (1) 
or, for the momentum set, 
[Fa7\ 30; sora li 2a Fs (1’) 


and these must be mutually independent, their number equalling f, the 
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number of degrees of freedom of the microsystem. The state of a particle 
is thus specified by the complete set of Q-type dynamic variables: 
Q,, Q2,..-, O; or alternatively (P)=P,, P2,..., Py. 

Prediction of the history of a particle would mean prediction of the 
Sequence Q(t) or P(t) for instants f: 


tit, <t, Seed ty <ee<cty. 


An interesting theorem due to Mandel’shtam ! may be considered before 
we turn to an analysis of the problems arising here. 

In the general uncertainty relation of (XI-3) we put /= LY and =H, 
in which # is the operator for the energy E; we then have 


a a 
AL? - AE* > my (eA 24)", (2) 
in which AL? and AE” are the variances of L and E, while 


[2,2] = ; (HL — LH), 


On the other hand, 


ee): 
#, Pi =~. 3 
[77-5 (3) 
Hence, 
qe-ae > el j 
~ 4 \de (4) 


This formula implies that we have an “uncertainty relation’? of the 
following form if the mean L of L in time At alters by an amount AL 
(4L)/*: 

h 
AE At> =, (5) 
in which 


AE = ./ZE?. 


Consider now the description of the history of the particle in terms of 
the # variables. Let one of these variables, say P ‘p> be equal to # then 
AE=0, and (5) shows that we require an infinite time At for an appreci- 
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able change to occur in the mean value of any of the dynamic variables 
(At>h/2AE). 

Thus there is no history at all if the microparticle is described via the 
momentum-energy variables. Of course, this conclusion is not unexpected 
because the description of the state via these variables would correspond 
in classical mechanics to description of the motion by means of integrals 
of motion, which are essentially independent of time. 

An entirely different situation arises if the history of a particle is 
described via the complete set of Q-type variables, which do not commu- 
tate with the hamiltonian #, by virtue of the principle of complementarity; 
hence generally speaking the means of dQ/dt differ from zero. Then (4) 


implies that AE*>0 and: 


2 Th\ 2 
qe (2) 3 (6) 


4\ ar) AQ,’ 


so that states of the ensemble are not stationary if these Q variables are 


specified with finite error (A0?# 0); all the probabilities, means, variances, 
and so on for all of the quantities will vary with time. The ensemble has a 
history, and hence a microparticle has one too. 

The prediction of the history of a microparticle will consist of a 
prediction of the probability of some definite trajectory for the micro- 
particle: 


(Qo, to), (Q,, t;); eseg (Q,, ty)» seen (On, t) (7) 
From (XVI-16) we have? that the probability of such a trajectory is 
IW (Qos tos Qty ts +> Ques tes +s On, t) = P(On, ts On—1, tw-1) 


P(Qy-15 ty-13 Qn-2, tycolieel Opt Otte 
P(Q,, ty; Qo, to) dQ1,---, dOn-1, (8) 


in which the individual factors are given by (XVI-17). For the very simple 
case of free motion we have from (XVI-19) that 


im (x, — %-1)" I’ 
exp| y At -| ©) 


d 
P(Qys tis Qe—15 te-1) IQ = = 


in which 


At=t—ty-1- 
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The product in (8) has the meaning of the probability that successive 
measurements of coordinate Q at instants 


to < ty < Lx yawey < by < prey < ty 
will give respectively the values 


Q,+3dQ), Q,+34dQ),..., 0,+4dQ,,..., On +4 dQy 


each such measurement being a macroscopic event. Hence the probability 
of (8) is the probability of a certain sequence of macroscopic events that 
serves to describe the history of an individual particle (Figure 12). 

This chain of events could be extremely striking if each measurement 
of Q did not lead to a state of infinitely large energy E. In this case the 


4 
“ 


— 


Fig. 12. Path ofa microparticle as a series of macroscopic events: cach circle denotes 
a macroscopic phenomenon caused by the microparticle, e.g. a bubble in a bubble 
chamber. 
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operator for the energy of the microparticle is 


ee es 
Om dx? 
while the wave function is 
1 in 
W(x, j= r exp ls (x — ss) (10) 


in which XQ is the coordinate derived from the previous measurement. The 
mean energy E of the microparticle in state (10) is given by the usual 
formulae of quantum mechanics as 


Jy* sw dx 

fury dx - 
These integrals are easily calculated if we replace the m of (10) by m+16 
with 6>0, thereafter passing to the limit 60, which gives 


E= 


(11) 


£ h? m? 1 5 0 , > 
8m 6 hAt’ ee 2) 
so that E tends to +00 as 6->+0. Hence to produce the trajectory 
described by (8) we must have unlimited energy from some source: either 
in the measuring instrument A+ D or in the microparticle 4 itself. 

For this reason we shall now consider a more realistic scheme for the 
description of the trajectory of an individual particle, in which we do not 
assume an infinitely precise measurement of Q. 

We assume that the microparticle interacts with infinitely heavy atoms 
whose disposition in space is known, thereby transferring them from the 
ground state W(x) to an excited state w,,(x), in which x is the coordinate 
of an electron in the atom, and w(x) is its wave function. The interaction 
energy W(x—(Q) of our microparticle with an electron of the atom will be 
taken as represented by a 6-function: 


W(x — Q)=9'6(x—Q), (13) 


in which Q is the coordinate of the microparticle and g is the coupling 
constant. The atom receives an excitation energy e=E,—£ , while the 
energy of the microparticle is E=P?/2M, M being the mass of the 


127 


THE PHILOSOPHY OF QUANTUM MECHANICS 


particle, whose momentum before the collision is P and after the collision 
Ps 

We assume that Ee, so that the microparticle itself is the source of 
the high energy needed for its localization. 

We suppose that the matrix element for the density of the electron in 
the atom, 


Pn (x) = Wa (x) Wo (x) (14) 
vanishes for |x| >a, so that a behaves as the size of the atom. The Fourier 
transform of this is 


Pala) = | pa(x) e* dx (15) 


and will vanish in the region |gq|> 1/a. 

For simplicity we shall also assume that only one matrix element p,(x) 
is of any importance, all the others being negligibly small. 

The role of the analyser A is played by the excitation E,—-£, of the 
atom localized in space; this sorts the microparticles in accordance with 
their Q coordinates with an error of the order of a, the size of the atom. 
This excited atom may subsequently produce a latent-image state in a 
grain of a photographic emulsion, local evaporation in a bubble chamber, 
condensation in a Wilson cloud chamber, and so on (all macroscopic 
effects). These or other processes act as the detector D that destroys the 
interference between the various states of the microparticle. 

The mode of operation of the detectors will not be discussed; we 
consider only the primary function of part A, which performs the analysis 
of the coordinates. The analyser A (here a heavy atom capable of being 
excited) is macroscopic because the mass of the atom has been assumed to 
be infinitely large, so that the position of its centre of gravity in space is 
completely defined. 

The wave function of the system +A at the initial instant t=0 is 


bo (x, Q) = Wo(x) exp (: ©): (16) 
For t>0 this function is put as 
@ x, Q, t) = bo(x, Q) e+ v(x, 2,4), (17) 


in which @) =(E+ E)/h; p(x, Q, t) will be taken as small and calculated 
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via perturbation theory. The method used in Chapter XI gives us from 
(XI, 14) that 


twot 


0.8 0,)=25= [pn(x) exp] :(P~P% )]3’~0) 
ee gos ae i 
——<— d8p'-dx' 40-4 (3) exp, or'|, (18) 
h 
Zz i 
Q=— = (Ep + Eo — Ep — E, = Ep — Ep — 8). (19) 


We assume that Ep>e, so that we have 
= = (Er Ep) = Qpp. 


Consider now (x, Q, ¢) for 


PP’ 


i.e. immediately after a possible collision of the microparticle with the 
atom. We have from (18) that 


eivot 


oa 
ats) | pala’) eS +32 a, (20) 


— 


g x,Q, tat 


in which q=P’ —P, or 


twot 


a 


SOS he oO) exp] |= Wal) $9(Q). 20 


In other words, the state of the microparticle (which produces the exci- 
tation E,—E,) is represented by the following wave function for small t: 


#(Q) = p9(0) exp] = OP |. @1) 


This function is a wave packet with a probability distnbution for the 
coordinate of the microparticle: 


W,(Q) = |pn(Q)I’. (22) 
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the initial average momentum being 


P= ih | $%() Yoba(Q) 40. (23) 


This is so to within the momentum of the electron in the atom, which is 
of the order h/a<¥. 


Fig. 13. Tracks of 2- mesons with an energy of 340 MeV in a hydrogen bubble 

chamber (recorded in the Nuclear Problems Laboratory Joint Nuclear Research 

Institute). The z- meson denoted by the arrow interacts witha proton p via the reaction 

m + p—>n-+ n- + 2+ (nis a neutron), the z— meson being scattered downwards to 

the left, while the z+ goes upwards and soon decays via m+ > wt -+ v (vis a neutrino). 

The s* meson in turn decays via wt > et 4- »y + ¥, The top strongly curved track is 
that of the positron et. 
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Ionization of the atom is thus accompanied by localization of the 
microparticle in the wave packet of (21) so that ionization may be con- 
sidered as transition of the particle from the state exp(iPQo/h) at ty =0 to 
the state ¢,(Q,) at t=t, (the notion is the same as that employed previ- 
ously to characterize points on the locus of the microparticle). The 
packet of (21) fort >1/Q,,. spreads out until the next ionization at time 
t,; for 


t—t,< 
P’P 
we again get a packet of the form of (21), and so on. 

In this example the trajectory is described by a series of excitations of 
heavy atoms A, which lead to localizations of the microparticle, with 
subsequent initiation of processes in the detector D. 

The example is highly schematic, but its significance is very general: 
the individual history of a particle is expressed as a sequence of macro- 
scopic events. 

Figure 13 shows the history of a x meson which collides with a proton 
and so generates two new mz mesons. The meson tracks are formed by 
series of bubbles in liquid hydrogen. Could we not hope for a more 
detailed description of the history of a single macroparticle? 

Might we in such a description avoid the chain of macroscopic events, 
i.e. the language of microcosmical catastrophes in which the micro- 
particle tells us its history? For each bubble in the bubble chamber reflects 
a catastrophe in the microcosm. 

It may seem that there in no hope in the search for such a more detailed 
description of the history of a microparticle; in fact, we cannot point to a 
single experimental fact that would indicate that quantum mechanics is 
incomplete within the range of atomic phenomena of the microcosm. 

But, all the same, we should be cautious and remember the quotation 
from Prutkov: Who interferes with the seeker for unwettable gunpowder? 
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